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This appUcation claims the benefit of U.S. Serial No. 10/1 12,372, filed April 1, 2002, 
U.S. Serial No. 60/382, 614, filed May 24, 2002, U.S. Serial No. 10/164,717, filed June 10, 
2002, U.S. Serial No. 10/167,631, filed June 13, 2002, U.S. Serial No. 10/177,917, filed June 
24, 2002, and U.S. Serial No. 60/399,125, filed 30 July 2002, which are hereby incorporated 
by reference in tiieir entirety. 



Fig. 1 shows the expression of OTB0949 in human tissues. PGR was performed 
using SEQ NO 3 as the forward primer, and SEQ ID NO 4 as the reverse primer. 

Fig. 2 shows the expression pattern of human OTB182, an integral membrane protein, 
in human tissues. To detect gene expression, PGR, was carried out on aliquots of the 
normalized tissue samples using a forward (SEQ ID NO 13) and reverse (SEQ ID NO 14) 
gene-spedfic primers. 

Fig. 3 shows the amino add aUgmnent of human OTBl 82 (SEQ ID NO 12) with 
mouse AK003645 (SEQ ID 15). 

Fig. 4 (A-D) shows the amino add aligmnents of a human transient receptor potential 
cation channel (TRPGC) with related channel femily members. Human sequences are TRPM 
(SEQ ID NO 17), human AB046836 (SEQ ID NO 19), human XM_036123 (SEQ ID NO 
18), and mouse XM_140575 (SEQ ID 20). 

Fig. 5 shows the expression pattern of human TRPGG in human tissues. To detect 
gene expression, PGR was carried out on aliquots of the normalized tissue samples using a 
forward (SEQ ID NO 21) and reverse (SEQ ID NO 22) gene-specific primers. 

Fig. 6 shows the amino acid sequence alignments between different forms of the 
human melanocortin-1 receptor. NM_002386 or MCR-1 A (SEQ ID NO 30). MGR-IG 
(SEQ ID NO 26). MGR-1B(SEQIDN031). 

Fig. 7 shows a schematic of the exon sizes for the melanocortin-1 gme and the 
tubulin gene (exon 7). 

Fig. 8 shows the expression pattern of OTB860 in human tissues. SEQ ID NOS 40 
and 41 are the primer sequences. 

Fig. 9 (A-C) shows the amino add alignments of OTB860 (SEQ ID NO 39) and 
KIAA1678 (SEQ BD NO 42). 



DESCRIPTION OF THE DRAWINGS 
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Fig. 10 (A and B) is the amino add alignmaits of the difiBarent splice variants of 
human TARPP, Brl37A (SEQ ID NO 46), B (SEQ ID NO 48), C (SEQ ID NO 50), D (SEQ 
ID NO 52; SEQ ID NO 14, NM_016300), and E (SEQ ID NO 44), and partial clone 
AL133109 (SEQ ID NO 55). 
5 Fig. 1 1 is a schematic drawing showing the differences between the various forms of 

human TARPP. 

Fig. 12 (A-C) shows amino acid ahgnments of the different spUce variants of human 
TARPP (Brl37A, B, C, D, and E) with mouse TARPP (NM_033264; SEQ ID NO 53). 

The following procedure was used for the expression profile. A twenty-fovir tissue 
10 panel was used (lanes fiom left to rigiht): 1, adrenal gland; 2, bone marrow; 3, brain; 4, 

colon; 5, heart; 6, intestine; 7, pancreas; 8, liver; 9, lung; 10, lymph node; 1 1, lymphocytes; 
12, mammary gland; 13, muscle; 14, ovary; 15, pancreas; 16, pituitary; 17, prostate; 18, skin; 
19, spleen; 20, stomach; 21, testis; 22, thymus; 23, thyroid; 24, uterus. The lane at fhe fer left 
of each panel contains molecular weight standards. The results were obtained according to 
15 the following procedures: 

Polyadenylated mRNA was isolated fijom tissue samples, and used as a template for 
first-strand cDNA synthesis. The resulting cDNA samples were normalized using beta-actin 
as a standard. For the normalization procedure, PGR was performed on aliquots of the first- 
strand cDNA using beta-actin specific primers. The PGR products were visualized on an 
20 ethidium bromide stained agarose gel to estimate the quantity of beta-actin cDNA present in 
each sample. Based on these estimates, each sample was diluted with buffer until each 
contained the same quantity of beta-actin cDNA per imit volume. 

To detect gene expression, PGR was carried out on aliquots of the normalized tissue 
samples using a forward and reverse gene-specific primers. The reaction products were 
25 loaded on to an agarose (e.g., 1.5-2%) gel and separated electrophoretically. 



DESCRIPTION OF THE INVENTION 

The present invention relates to all facets of the novel genes described herein, 
polypeptides encoded by them, antibodies and specific binding partners thereto, and their 
30 applications to research, diagnosis, drug discovery, therapy, clinical medicine, forensic 
science and medicine, etc. The polynucleotides and polypeptides are usefiil in variety of 
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ways, including, but not limited to, as molecular maikers, as drug targets, and for detecting, 
diagnosing, staging, monitoring, prognosticating, preventing or treating, determining 
predisposition to, etc., diseases and conditions, associated with genes of the present 
invention. The identification of specific genes, and groups of genes, expressed in pathways 

5 physiologically relevant to particular tissues, permits the definition of fimctional and disease 
pathways, and the delineation of targets in these pathways which are usefiil in diagnostic, 
therapeutic, and clinical applications. The present invention also relates to methods of using 
the polynucleotides and related products (proteins, antibodies, etc.) in business and 
computer-related methods, e-g., advertising, displaying, offering, selling, etc., such products 

10 for sale, commercial use, licensing, etc. 



OTB0949 

OTB0949 is a polynucleotide (SEQ ID NO 1-2) which is expressed predominantly in 
brain tissue. Low levels of expression are observed in other tissues, e.g., adrenal gland, 

15 mammary gland, pituitary, stomach, and testes, but brain expression is at least 100-fold 
hi^er. See, e.g.. Fig. 1 . Because of its selectivity^ for brain, OTB0949 can be used as a 
molecular marker for brain tissue, e.g., in pathology and cytology, as well as a target, e.g., to 
ablate brain tissue, to deliver drugs to brain cells, etc. Jn the brain, OTB0949 is highly 
expressed in amygdala, hippocampus, thalamus, and retina. OTB0949 can also be a usefiil in 

20 diagnostics and therapeutics to treat neurological and visual disorders. 

The brain is one of the most complicated and least understood organs in the 
mammalian body. Anatomically, it is composed of four different regions: (1) cerebral 
hemispheres, (2) diencephalon (thalamus, hypothalamus, and qpithalmus), (3) brain stem 
(midbrain, pons, and medulla oblongata), and (4) cerebellimi- These can be further 

25 subdivided. For instance, the cerebral hemispheres contain cerebral cortex and basal ganglia 
(caudate nucleus, putamen, globus pallidus, lentiform nucleus, corpus striatum, amygdala). 
The midbrain contains, e.g., cerebral peduncles, corpora quadrigemina, colliculi, substantia 
nigra, and the red nucleus. Other regions and subdivisions of interest include hypothalamus, 
pituitary, cranial nerves, pineal, gray matter, white matter, raphe nucleus, limbic system, etc. 

30 Various cell types are found in the brain, including, supporting cells, such as neuroglia, glia, 
astrocytes, microglia, ependymal cells, oligodendrocytes, and Schwann cells, neurons, such 
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as multipolar, bipolar, unipolar, Purkiaje, and pyramidal cells. 

The gene is located at chromosomal position 12q24.2, Several neurological diseases 
have been mapped to this region, e.g., spinocerdjellar ataxia 2 (associated with a mutation in 
the ataxin-2 gene), spinal muscular atrophy (OMIM 158590), and amyotrophy (OMIM 
5 181405). Disruption of OTB0949 (e.g., in the corresponding gene a transgenic animal) can 
result in a brain disorder or susceptibility thereto, including those mentioned above. 

OTB0949 is coded for in a single exon, and comprises a short coding sequence of 
about 135 amino adds (SEQ ID NO 2). It contains a stretch of hydrophobic amino acids 
ftom about positions 76-100 (SEQ ID NO 2). Examples of promoters include, e.g., SEQ ID 

10 NOS 5-10 located in contig NT_009775 at about 557-607, 1263-1313, 1591-1641, 1635- 
1685, 1714-1764, and 1936-1986, respectively. 

The preswit invention also relates to polypeptides of OTB0949, e.g., an isolated 
human OTB0949 polypeptide comprising or having the amino acid sequence set forth in SEQ 
ID NO 2, an isolated human OTB0949 polypqptide comprising an amino acid sequence 

1 5 having 80, 85, 90, 95, 97, 99% or more amino acid sequence identity to the amino add 
sequence set forth in SEQ ID NO 1 , and optionally having one or more of OTB0949 
activities, such as cell signaling activity, secretory pathway activity, etc. Fragments spedfic 
to OTB0949 can also tised, e.g., to produce antibodies or other immune responses, as 
competitors to any of its activities, etc. These fragments can be referred to as being "specific 

20 for" OTB0949. The latter phrase, as akeady defined, indicates that the peptides are 

characteristic of OTB0949, and that the defined sequences are substantially absent from all 
other protein types. Such polypeptides can be of any size which is necessary to confer 
spedfidty, e.g., 5, 8, 10, 12, 15, 20, etc. 

25 OTB182 

OTBl 82 is an integral membrane protein comprising 307 anoino adds (SEQ ID NO 
12). Using SMART (e.g., Schultz et al., Proc. Natl Acad. ScL, 95:5857-5864, 1998; Letunic 
et al.. Nucleic AcidRes.y 30:242-244, 2002), the protein is predicted to have seven membrane 
spanning regions at about amino adds positions 45-67, 87-109, 129-151, 161-183, 196-218, 
30 231-253, and 278-297. There is a putative signal sequence (at amino adds 1-26) at its N- 
tCTninus which also overlaps with an eight transmembrane spanning domain (at amino adds 
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10-32). 

OTB182 is expressed predominantly in excitable tissues, e.g., brain, heart, and 
muscle, with very low expression observed in prostate tissues. See. Fig. 2. In the brain, it is 
expressed predominantly in thalamus. It also expressed in neural stem cells. Its expression 

5 in excitable tissues makes OTB 1 82 a highly selective marker for excitable tissues, as well as 
indicating a functional and/or developmental role in this tissue type. There are a number of 
genes whose expression is restricted predominantly to excitable cells, e.g., GABA-interacting 
factor-1 (GRIF-1; Beck et al., J, Biol Chem., 28 May 2002); calcium cahnodulin dependent 
(CaM) kinase (e.g., Loseth et al.. Brain Research, 869(1 -2): 137- 145, 2000); sodium channel 

10 types (e.g,, Schaller et al., J. Neurosci, 12(4):1370-81, 1992); calcium dependent 

mitochondrial solute carrier (e.g., Del Arco et al., J, Biol. Chem., 273(36):23327-23334, 
1998). 

A mouse homolog of OTB182 is AK003645 (SEQ ID NO 15) that codes for a 153 
amino add polypeptide. It shares about 92% sequence identity from amino acids 1-122, and 

15 sharply diverges from that point onward. See. Fig. 3. Murine OTB182 maps to 

chromosomal location 1 1E2. The present invention relates all transcripts associated with the 
AK003645 gene loci. The degree of nucleotide sequence identity between human and mouse 
(AK003645) is about 83% from about nucleotide position 71-437 of SEQ ID NO 12. 
OTB 182 is located at chromosomal band 17q25. A genetically-inherited 

20 neuromuscular disease, hereditary neuralgic amyotrophy (HNA) has been mapped to this 
locus. See, e.g., Jeannetetal.,iVeMro/ogy, 57:1963-1968, 2001. hi addition, a hereditary 
hearing loss maps to 17q25 (e.g., DFNA20, Morell et al., Genomics, 63:1-6, 2000) and 
mental retardation (Rio et al.. Human Genetics, 108:51 1-515, 2001). Examples of specific 
polynucleotides are SEQ ID NOS 13 and 14. 

25 Diseases or disorders which can be treated in accordance with the present invention 

include, but are not limited to neuropathy, neuralgic amyotrophy (e.g., HNA), myopathy, 
sensorineural hearing loss (e.g., DFNA20), mental retardation, neuromuscular disorders, 
brain cancer, such as a neuroblastoma, and other diseases and conditions involving heart, 
brain, and muscle tissues, etc. 

30 A transgenic animal with OTBl 82 fbnctionally disrupted can show a defect in an 

excitable cell. Such defect, includes, e.g., developmental defects, defects in the functional 
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activity of the cell, e.g., in excitability, membrane conductance, response to stimuli, signal 
transduction, or any of the disorders mentioned herein. 

Antibodies to OTB 1 82 can also be produced, e.g., an antibody which is spedfic-for: 
an epitope selected from amino acids 123-307 of SEQ ID NO 12, or comprising amino acid 
5 27, 47, 64, 66, 75, 78, 105, 1 1 1, or 1 13 of SEQ ID NO 12 

Human transient receptor potential cation channel (TRPCC) gene and polypeptide 

Human TRPCC codes for a polypeptide of 1707 amino acids. As shown in Fig. 5, it 
is selectively expressed in brain, kidney, and pituitary, with very low expression observed in 

1 0 testis and ovary. By the phrase "selectively expressed," it is meant that a nucleic add 

molecule, when produced as a transcript, is characteristic of the tissue or cell-type in which it 
is made. This can mean that the transcript is expressed only in that tissue and in no other 
tissue-type, or it can mean that the transcript is expressed preferentially, differentially, 
predominantly, and more abundantly (e.g., at least 5-fold, 10-fold, etc., or more) in that tissue 

1 5 when compared to other tissue-types. 

The nucleotide and amino acid sequences of human TRPCC are shown in SEQ ID 
NOS 16 and 17. Analysis of its primary structure indicates the presence of six 
transmembrane domains at about amino acids 870-892, 901-1112, 904-921, 936-958, 971- 
990, 1005-1024, 1085-1 107 of SEQ ID NO 17, however, by analogy to other ion channels, it 

20 is generally believed to have only six transmembrane spanning regions. See, e.g., Clapham et 
al., Nature Reviews, Neuroscience, 2:387, 2001 . The ion transport domain comprises amino 
adds 901-1 1 12. There is also a putative transmembrane domain at the N-terminus at about 
amino acids 5-24. According to the six-transmembrane model, both the N- and C-terminus 
of the protein are intracellular, and provide a scaffolding for interaction with other proteins. 

25 The human TRPCC contains 25 exons. The present invention relates to any isolated 

introns and exons that are present in the gene, hitron and exon boundaries can be routinely 
determined, e.g., using the sequences disclosed herein. 

Partial sequences for human TRPCC were previously identified (e.g.. Accession 
Numbers AB046836 and XM_036123). For example, human AB046836 (SEQ ID 19) is 

30 incomplete, coding for 1017 amino acids (See Fig. 4, AB046836), and lacks the first 690 
amino acids of human TRPCC, but shares about 99% identity with TRPCC along the rest of 
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its length. Another partial sequence, human XM_036123 (SEQ ID NO 1 8) codes for 988 
amino adds (See Fig 4, XM_036123), lacking flie first 719 amino acids of human TRPCC, 
butshares 100% identity with TRPCC along the rest ofits length (See Fig 4). XM_140575 
(SEQ ID NO 20) appears to be a homolog of human TRPCC, and shares about 94% sequaice 
5 identity from about amino acids 82-693, or about amino acids 345-956 of human TRPCC 
(SEQ ID NO 17). Amino acids 1-81 and 694-736 (see Fig. 4) of the mouse homolog have 
low sequence identity with human TRPCC. Aligmnent with mouse genomic DNA using 
Spidey (NCBI) indicates that amino acids 1-80 of XM_1 40575 are derived from exons 1 and 
2 of the genomic DNA, and amino acids 694-736 are derived from exon 7 of the mouse 

1 0 genomic DNA. XM_140575 is located on mouse chromosome 19B. 

TRPCC maps to chromosomal region 9q21.L Strikingily, hypomagnesemia with 
hypocalcemia (OMIM 602014) are known to be detemiined by a miitation wiflun 9q21 
(Walder et al.. Human Molecular Genetics, 6: 1491-1497, 1997), as would be expected wifli 
a channel responsible for cation conductance. Consistent with its expression in brain, a 

1 5 susceptibilify to amyotrophic lateral sclerosis with frontotemporal dementia (OMIM 105550) 
was mapped to this same chromosomal locus (Pinsky et al.. Clinical Genetics, 7:186-191, 
1975; Hosier et al., JAMAy 284:1664-1669, 2000). In addition, schizophrenia (Hovattaet al.. 
Am, J, Hum. Genet, 65:1 1 14-24), and familial dyskinesia/facial myokymia (Fernandez et al., 
Ann, Neurol, 49:486-92, 2001) are also associated with this gene locus. Nucleic acids of the 

20 present invention can be used, e.g., as linkage markers, diagnostic targets, and therapeutic 
targets for any of the mentioned disorders, as well as any disorders or genes mapping in 
proximity of TRPCC. 

TRCC polynucleotides, polypeptides, ligands, and binding partners thereto, can be 
used in a number of useful ways. For example, binding partners, such as antibodies and 

25 ligands, can be used to selectively target agents to brain, kidney, and other tissues in which it 
is expressed for purposes including, but not limited to, imaging, diagnostic, therapeutics, etc. 
Imaging of tissues can be facilitated using agents such as TRPCC antibodies that can be used 
to target contrast agents to a specific site in the body. Various imaging techniques have been 
used in this context, including, e.g.. X-ray, CT, CAT, MRI, ultrasound, PET, SPECT, and 

30 sdntographic. A reports agent can be conjugated or associated routinely with a TRPCC 

antibody. Ultrasound contrast agents combined with ligands such as antibodies are described 
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in, e.g., U.S. Pat. Nos 6^64,917; 6,254,852; 6,245,318; and 6,139,819. MRI contrast agents, 
such as metal chelators, radionucleotides, paramagnetic ions, etc., combined with selective 
targeting agents are also desaibed in the literature, e.g., in U.S. Pat Nos. 6,280,706 and 
6^21 ,334. The methods described therein can be used generally to associate TRPCC and 

5 Ugands thereof with an agent for any desired purpose. 

An active agent can be associated in any manner with an TRPCC Ugand that is 
effective to achieve its deUvery to the target. The association of the active agent and the 
ligand ("coupling") can be direct, e.g., through chemical bonds between the binding ligand 
and the agent or via a linking ag^t, or the association can be less direct, e.g., where the 

1 0 active agent is in a liposome, or oihec carrier, and the ligand is associated with the liposome 
surface. Li such case, the ligand can be oriented in such a way that it is able to bind to 
TRPCC on the surfaces of kidney or brain cells. 

Useful human TRPCC polypeptides and corresponding nucleic adds include 
polypeptides comprising amino acids 1-88, 5-24, 1-690, 1-719, and fragments tiiereof (See 

15 SEQ ID NO 17 and Fig. 4). Nucleic adds and polypeptides can be used as probes (e.g., in 
PCR, in Northem blots, etc.), as diagnostic agents, to geaemte antibodies, as vaccines, to 
produce recombinant proteins, as antisense, etc. A spedfic polynucleotide according to the 
present invention can be determined routinely. Examples are specific probes are SEQ ID 
NOS 21-24, e.g., where SEQ ID NOS 23 and 24 can be used as forward and reverse PCR 

20 primers, respectively, to amplify a portion of amino acid region 1-1 60 of SEQ ID NO 17. 

TRPCC has a number of biological activities, including, e.g., cation transport, signal 
transduction, protein binding, etc. By "signal transduction" is meant the activation of a 
chain of events that alters the concentration of one or more small intracellular signaling 
molecules (second messengers), e.g., cyclic AMP, calcium ions, as described in Sabala et al., 

25 British Journal of Pharmacology, 132:393-402, 2001 . By "cation transport" is meant Ihe 
influx or e£Qux of a cation, e.g., calcium, magnesium, into or from a cell. Mizuno et al.. 
Molecular Brain Research, 64:41-51, 1999. Protein binding indicates the ability of the 
protein to interact with other proteins, e.g., as the N-terminus interacts with intracellular 
proteins. These activities can be determined routinely. Signal transduction can be assessed 

30 by expression of TRPCC in cells, etc., and measurement of the concentrations of elicited 

second messengers or byproducts, e.g., Ca^^ or Mg^^ or cAMP, inositol, etc., by, e.g., atomic 
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absoiption spectrometry (ThemioElemaital SOLAAR AA spectrometers), 
radioimmmoassay, etc. Sano at al. Science, 293:1327-1330, 2001. Cation transport can be 
assessed by measurement of changes in ionic currents by whole-cell patch-clamp analysis. 
For instance, cells or oocytes can be transfected with a polynucleotide of the present 

5 invention and then analyzed for expression of calcixim chaimel activity, e.g., using patch 
clamp, calcium activated dyes, etc.. See, also, e.g., Strubing et al.. Neuron, 29:645-655, 
2001; Sano et al.. Science, 293:1327, 2001; Ohki et al., J. Biol Chem., 275:39055-39060, 
2000; Boulay et al., J, Biol Chem,, 272:29672-29680, 1997. 

The present invention relates to an isolated polynucleotide comprising, e.g., a 

10 polynucleotide sequence coding without interruption for a human TRPCC polypeptide, or 
complement thereto, said TRPCC havmg 80%, 85%, 90%, 92%, 95%, 99%, or more amino 
add sequence identity along its entire length to the sequence comprising amino acids 1-690 
of SEQ ID NO 17, and 80%, 85%, 90%, 92%, 95%, 99%, or more amino acid sequence 
identity along its entire lengtii to the sequence comprising from amino acids 691-1707 of 

15 SEQ ID NO 17, and which has, e.g., cation transport, signal transduction, or protein bmding 
activity. 

Antibodies can be prepared against specific epitopes or domains of TRPCC, e.g., 
amino acids 2-30, 773-789, 870-887, 905-913, 943-958, 969-986, 1005-1022, 1087-1114, 
1125-1131, 789-870, 913-943, 986-1005, etc. 

20 Detection can be desirable for a variety of different purposes, including research, 

diagnostic, prognostic, and forensic. Diagnostic purposes included testing patients and their 
&milies for the presence of mutations associated with hypomagnesemia with hypocalcemia 
or amyotrophic lateral sclerosis with fix)ntotemporal dementia. The selected mutant alleles, 
SNPs, polymorphisms, etc., can be used diagnostically to determme whether a subject has, or 

25 is susceptible to a disorder associated with TRPCC, as well as to design therapies and predict 
the outcome of the disorder. Methods involve, e.g., diagnosing a disorder associated with 
TRPCC or determining susceptibility to a disorder, e.g., hypomagnesemia with hypocalcemia 
or amyotrophic lateral sclerosis with frontotemporal dementia, comprising, detecting the 
presence of a mutation in a TRPCC gene (such as a mutation in SEQ ID NO 16, or variants 

30 tiiereof. The sequences of TRPCC genes can also be compared, e.g., between a normal gene 
as shown in SEQ ID NO 16 and the sequence of a gene from a patient with the disorder, e.g.. 
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hypomagnesemia with hypocalcemia. 

Fragments specific to TRPCC can also be used, e.g., to produce antibodi^ or other 
immune responses, as competitors to nucleotide binding, ligand binding, etc. or as, e.g., 
inhibitors or stimuli in signal transduction pathways. These fragments can be referred to as 
5 being "specific for'' TRPCC. The latter phrase, as akeady defined, indicates that the peptides 
are characteristic of TRPCC, and that flie defined sequences are substantially absent from all 
other protein types. Such polypeptides can be of any size necessary to confer specificity, e.g., 
5, 8, 10, 12, 15, 20, etc. Examples of polypeptides include but are not limited to polypeptides 
that comprise the following amino acid residues: 2-60, 598-660 of SEQ ID NO 17, or 

1 0 fiagments thereof. 

Biological activities of TRPCC include, e.g., cation chaimel activity, signal 
transduction activity, and protein binding activity. As discussed above, the biological activity 
of TRPCC can be measured routinely. For example, if agents are to be identified which 
modulate the channel activity of TRPCC either electrophysiology or caldam imaging can be 

1 5 used to assess their eflfects, e.g., using fluo-3, Fura-red, Ca-sensitive chemi-luminescent 
probes, etc. (e.g., kits are connnercially available from Molecular Probes) and a laser 
scanning confocal microscope to visualize the changes in intracellular calcium as a result of 
modulation of TRPCC. 

A transgenic animal, or animal cell, lacking one or more fimctional TRPCC genes can 

20 be useful in a variety of applications, including, as an animal model for hypomagnesemia 
with secondary hypocalcemia, amyotrophic lateral sclerosis with frontotemporal dementia, 
etc., drug screening assays (e.g., for signal transduction mediated by agents other than 
TRPCC; by making a cell deficient in TRPCC, the contribution of other receptors to, e.g., 
Ca^**" modulation can be specifically examined), as a source of tissues deficient in TRPCC 

25 activity, etc. Such an animal can show a defect in cation (e.g., calcium) conductance, e.g., an 
impairment in the permeation of an ion through the channel. 

An isolated polynucleotide can comprise, e.g., a polynucleotide sequence coding 
wifliout interruption for a human TRPCC polypeptide, or complement thereto, said TRPCC 
having 90% or more amino acid sequence identity along its entire length to the sequence 

30 comprising amino acids 1-690 of SEQ ID NO 17, and 90% or more amino acid sequence 
identity along its entire lengtii to the sequence comprising from amino acids 691-1707 of 
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SEQ ID NO 1 7, and which has cation transport activity. 

The present invention also relates to a methods of identifying a mutation associated 
with amyotrophic lateral sclerosis with jBx>ntotemporal dementia, comprising: comparing the 
structure of. genomic DNA comprising all or part of human TRPCC, mRNA comprising all 
5 or part of human TRPCC, cDNA comprising all or part of human TRPCC, or a polypeptide 
comprising all or part of human TRPCC, with the complete structure of human TRPCC as set 
forth in SEQ ID NO 16, in a patient having amyotrophic lateral sclerosis with frontotemporal 
demratia, or a family member thereof. 



10 Melanocortin 

The present invention relates to novel forms of a melanocortin- 1 receptor (also known 
as **MCR-1" or alpha-melanocyte stimulating hormone receptor). It is highly expressed in 
melanocytes, and is a key component of the pathway which modulates skin and hair 
pigmentation. Moreover, certain alleles of MCR-1 are associated with a high risk of 

1 5 melanoma. MCR-1 is also expressed in other tissues, including monocytes, mast cells, 
placenta, pituitary, and endothelial cells. 

MCR-1 belongs to the G-protein coupled receptor (GPCR) super-family. Its 
expression is restricted to melanocytes and few other cell types, such as monocytes, mast 
cells, and endothelial cells. See, e.g.. Smith et al., Gene, 281:81-94, 2001; Scholzen et al,, 

20 Annals oftheNew York Academy of Sciences, 885:239-253 (1999). Stimulation of the 
receptor by its natural ligands (e.g., alpha-melanocyte stimulating hormone or "a-MSH") 
causes an increase in cAMP levels which, in tum, stimulates intracellular tyrosinase activity. 
Increased activity of the tyrosinase enzyme drives the conversion of phaeomelania (yellow 
and red pigments) to eumelalanin (brown and black pigments). 

25 The MCR-1 gene is located at chromosomal position 1 6q24. It is adjacent to the 

tubulin TUBB4 gene, and its 3* region overlaps witii the tubulin promoter (Smith et al.). 
Transcripts containing genie material from both MCRl and TUBB4 have been identified, 
including transcipts which contain coding sequences from both. See, eg., NCBI accession 
number BC02017L These may be involved in cancer. 

30 Almost 40 different polymorpWsms in the MCR receptor have been identified See, 

Sturm et al,. Gene, 277:49-62, 2001; Table 1. Several of these (e.g,, ArglSlCys; Argl60Trp; 
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Asp294His) are strongly associated with red hair, feir skin, and poor tanning ability. It has 
been reported that fliese alleles are nonfunctional receptors and do not stimulate cAMP 
production when stimulated by MSH, See, Table 2. As a consequence, phaeomelanin is not 
converted to eumelalanin, and skin and hair color reflect the cell's high content of flie yellow 

5 and red phaeomelanin pigments. Significantly, individuals who have these alleles are also at 
a higher risk for skin cancers, such as basal cell carcinoma, squamous cell carcinoma, and 
melanoma. See, e.g., Sturm et al., Am. J, Hum, Genet., 6 (supplement to volume 67): 16, 
Oct 2000. See, also OMIM, No. 1 55555 for other information on MCR-1, including disease 
information, polymorphisms, etc. 

10 The present invention relates to novel forms of MCR-1 . In one embodiment, the 

present invention relates to a novel MCR-1 variant, MCR-1 type C or MCR-IC, which 
possesses a unique carboxy-terminus. Previous reports had identified a 3 17 amino acid form 
of MCR-1 CMCR-l A") m a number of different species, including human (SEQ ID NO 30), 
chimpanzee, muskox, sheep, cow, horse, dog, and fox. This form was characterized as fuU- 

1 5 length. Several minor size variants were observed, as well, e.g., in mouse (315 amino add 
acids), in pig (310 amino acids), and in cow (321 amino adds). A second form, MCR-IB, 
was (SEQ ED NO 31) also reported that had an additional 65 amino acids at its terminus (Tan 
etal.,F£B5i:e//er^, 451:137-141, 1991; WOOO/39147). The present invention relates to a 
third form of MCR-1 (MCR-IC) that comprises 32 carboxy-amino acids (amino adds 367- 

20 398 of SEQ ID NO 26) not previously identified in any melanocortin receptor variant. This 
novel form comprises part of tiie new carboxy temiinus identified in MCR-IB, but diverges 
from it at amino acid position 367. See, Fig. 6. 

Fig. 7 shows exons which have been detected in melanocortin- 1 receptors. Exons 1, 
2, and 3 contain MCR coding sequences; Exons 5, 6, and 7 contain tubulin coding sequences. 

25 MCR-IA (e.g., NM_002386: SEQ ID NO 30) contains exon 1, and MCR-IB contains 

exons 1 and 2. MCR-IC contains coding sequence from exons 1-3. As indicated by the stop 
codon TGA, exon 3 comprises both coding and noncoding sequence. MCR-IC can also 
contain noncoding sequences, e.g., exons 4, 5, 6, and/or 7 (e.g., for a total of exons 1-7). 
BC020171, mentioned above, contains the coding sequence from exons 1 and 2 fused to the 

30 coding sequences of tubulin in exons 5-7. Examples of promoters for MCR-IC include, e.g., 
SEQIDNOS 35-37. 
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The preset invention also relates to a polymorphism at amino acid position 120, 
where an isoleucine (I) is replaced with a threonine (T). Isoleucine is present at amino add 
position 120 in most melanocortin recq)tor-l homologs, except pig which has a methionine 
substitution. This polymorphism may affect the receptor's functionality. Analysis of tiie 
5 transmembrane structure using TMHMM v. 2.0 (Krogh et al.. Journal of Molecular Biology^ 
305(3):567-580, January 2001; Sonnhammer et al., In J. Glasgow et al., editors, Proceedings 
of the Sixth International Conference on Intelligent Systems for Molecular Biologyy pages 
175-182, Menlo Park, CA, 1998. AAAI Press; MoUer et al., Bioinformatics, 17(7):646-653, 
July 2001) reveals a different number of predicted transmembrane sequences than the 

1 0 isoleucine isoform. 

The present invention relates to any polynucleotide, or polypeptide encoded thereby, 
which codes for MCR-IC, including receptors having any polymorphism, such as the 
naturally-occuning polymorphisms listed in Tables 1 and 2, and those disclosed herein. 
Examples include SEQ ID 27 having 120T and 163Q, SEQ 28 having 1201 and 163R, and 

15 SEQ ID 29 having 1201 and 163Q. It also includes polynucleotide and polypeptide fragments 
which are specific forMCR-lC (e.g., 367-398 of SEQ ID NO 26 and fragments thereof), and 
polynucleotides and polypeptides which comprise such specific fragmmts. For example, flie 
present invention relates to a polynucleotide comprising a coding sequence for amino acid 
367-398 of SEQ ID NO 26, or fragments thereof, such as any five amino add sequence 

20 contained therein. 

The present invention also relates to an isolated polynucleotide comprising, a 
polynucleotide sequence coding without interruption for a human MCR-IC, said MCR-IC 
having about 80%, 85%, 88%, 89%, 90%, 92%, 95%, 99%, etc., or more amino acid 
sequence identity along its entire length to the amino acid sequence set forth in SEQ ID NO 

25 26, or a complement tiiereto, and which has Ugand-binding activity, G-protein binding 

activity, or cAMP production activity. For example, such a polynucleotide can comprise one 
or more of the polymorphisms listed in Tables 1 and 2 (e.g., if 36 of the listed 
polymorphisms were present in such a polynucleotide, it would have about 90% (360/398) 
sequence identity along its entire length to the amino acid sequmce of SEQ ID NO 26. A 

30 corresponding amount of nucleotide is included, e.g., 90%, 92%, 95%, 97%, 98%, 99%, or 
more. 
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Similarly, the present invention relates to an isolated polynucleotide comprising, a 
polynucleotide sequence coding without interruption for a human MCR-IC, or complement 
thereto, said MCR-IC having 80%, 84%, 85%, 86%, 88%, 90%, 95%, or more amino add 
sequence identity along its entire length to the sequmce comprising amino acids 1-316 of 
5 SEQ ID NO 26, and 85%, 90%, 92%, 95%, etc. or more amino acid sequence identity along 
its entire length to the sequence comprising from amino acids 317-398 of SEQ ID NO 26, 
and which has ligand-binding activity, G-protein binding activity, or cAMP production 
activity. 

As stated, a polynucleotide can code for a polypeptide having one or more of the 

10 following activities, ligand-binding activity, G-protein binding activity, cAMP production 
activity, or o&er functional activities. Ligand-binding activity indicates the ability of MCR- 
1 C to bind specifically to a receptor ligand, such as a ACTH, MSH, etc. Ligand binding 
activity can be using a radioactive or otherwise labeled ligand, or whole-cell assays using 
labeled Ugands. See, e.g., WO0039147, W09957148, U.S. Pat. Nos. 5,731,408, 6,100,048, 

15 and 6,350,760; Libert et al., Pigment Cell Res., 2:510-518, 1989. 

G-protein binding activity indicates the ability of the receptor protein to bind to a G- 
protein. Such binding does not determined routinely, e.g., using filtration assays necessarily 
have to be productive, i.e., the binding does not have to result in stimulation of the signal 
transduction cascade. G-protein binding can be measured using in vivo and in vitro binding 

20 assays, as well as functional assays. See, e.g.. Ford et al.. Science^ 280:1271-1274, 1998. 
cAMP production is a measure of the ability of the receptor to stimulate the 
generation of cAMP upon binding by a receptor agonist. MCR is known to couple to G- 
proteins and thereby activate adenyl cyclase, increasing intracellular levels of cAMP (e.g., 
Buckley & Ramachandran, Proc. Natl Acad, ScL, 78: 7431-7435, 1981; Grahame-Smith et 

25 al., 1967, J. Biol Chem. 242: 5535-5541; Mertz & Catt, 1991, Proc. Natl Acad Sci. 88: 
8525-8529; Pawalek et al., 1976, Invest Dermatol 66: 200-209). This property of cells 
expressing the MCR-IC can be used assess its ^^cAMP production activity." For example, 
cells can be transfected with MCR-IC DNA, plated, and washed once with DMEM 
containing 1% bovine serum albumin (BSA) and 0.5 mM IBMX (a pho^hodiesterase 

30 inhibitor). The cells can then be treated with hormone (e.g., alpha-MSH, gamma-MSH, 

ACTH, etc.). Following hormone treatment, tiie cells can be washed with phosphate bujSered 
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saline, and intracellular cAMP extracted by lysing the cells. Ihtracellnlar cAMP 
concentrations can be determined routinely, e.g., using an assay (Amersham) which measures 
the ability of cAMP to displace cAMP from a high affinity cAMP binding protein (see 
Oilman, 1970, Proc. Natl Acad. ScL, 67: 305-312). 

5 Polynucleotide and polypeptides of the presrat invention can be used for a variety of 

purposes, including, but not limited to, treating cancers, treating skin cancer and other 
cancers modulating skin and hair pigmentation, identifying MCR ligands, modulating the 
MCR-1 receptor types, determining susceptibility to skin cancer, detecting MCR-IC 
expression, detennining polymorphisms in MCR-1 C, making MCR-1 C polypeptide, 

10 expressing MCR-IC in host cells, making antibodies to MCR-1 receptor types, modulating 
cutaneous inflammation (see, e.g., Bhardwaj et al,, J. Immunol^ 158:3378-3384, 1997; Luger 
et al., Ann, NY Acad. ScL, 917:232-238, 2000), modulating melanocytes, monocytes, 
endothelial cells, or other cells in which MCR-IC is expressed, etc. 

The expression of MCR-IC on the surface of melanoma cells makes it a useful target. 

1 5 Melanoma is a skin cancer which originates from melanocytes present normally in the 

epideraiis and underlying cell layers. There are four basic types: lentigo maligma melanoma, 
superficial spreading melanoma, nodular melanoma, and acral lentigous melanoma. Because 
of its expression on melanocytes, MCR-IC specific antibodies and other binding partners can 
be used to treat melanoma, e.g., by conjugating cytotoxic agents to antibodies directed to the 

20 receptor. In addition, MCR-IC polynucleotides, polypeptides, and binding partners thereto 
can be used to detect metastatic melanoma cells. 

Modulation of the MCR-IC can also be used to modulate skin pigmentation, e.g., to 
increase the amovmt brown and black pigments to darken skin color, to provide protective 
effects against UV radiation, to block receptor activation, e.g., preventing or reducing the 

25 accumulation of brown and black pigments in the skm, preventing or reducing tanning, 
preventing or reducing skin freckling, etc. Agonists and antagonists of the melanocortin 
receptor, include, alpha-melanocyte stimidating hormone and adrenocorticotropic hormone. 
Other Ugands are disclosed in, e.g., W09957148, U.S. Pat. Nos. 5,731,408, 6,100,048, and 
6,350,760, and can be identified and isolated as described in these patents, as well as 

30 WO0039147. 

As discussed earlier, several MCR-1 alleles have been associated with a greater risk 
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of skin cancer. For example, the presence of flie Asp84Glu variant mq)Osed a higli risk of 
melanoma in individual carriers. See, Kennedy et al., J. Invest Dermatol^ 1 17:294-300, 
2001. Other alleles with increased risk of melanoma included, Val60Leu, Val92Met, 
Argl42ffis, ArglSlCys, Argl60Trp, Argl63Ghi, and His260Pro (Kennedy et al.). See, also, 

5 Scott et al., /. Cell Scu, 1 15 (Pt 1 1):2349-2355, 2002. MCR-IC of the present invention can 
be used to assess melanoma risks, e.g., detOTnining the presence of a variant of MCR-IC in 
individuals, and whether such variants are associated with skin cancer and oihsx melanocyte 
disorders. Analysis can be performed by any suitable method, e.g., by single-stranded 
conformation polymorphism analysis and DNA sequence analysis. 

10 Expression can also be "selective," where expression is observed. By the phrase 

"selectively expressed/' it is meant that a nucleic acid molecule comprising the defined 
sequence of nucleotides, when produced as a transcript, is characteristic of the tissue or cell- 
type in which it is made. This can mean that the transcript is expressed only in that tissue 
and in no other tissue-type, or it can mean that the transcript is expressed preferentially, 

1 5 differentially, and more abundantly (e.g., at least 5-fold, 1 0-fold, etc., or more) in that tissue 
when compared to other tissue-types. 

In view of their selectivity and display on the cell surface, MCR-IC polypeptides of 
the present invention are a useful target for histological, diagnostic, and therapeutic 
applications relating to the cells in which they are expressed. Antibodies and other protein 

20 binding partners (e.g., ligands, aptamers, small peptides, etc.) can be used to selectively target 
agents to a tissue for any pvirpose, included, but not limited to, imaging, therapeutic, 
diagnostic, drug delivery, gene therapy, etc. For example, binding partners, such as 
antibodies, can be used to treat melanomas in analogy to how c-erbB-2 antibodies are used to 
breast cancer. They can also be used to detect metastatic cells, in biopsies, etc. The genes 

25 and polypeptides encoded thereby can also be used in tissue engineering to identify tissues as 
they appear during the differentiation process, to target tissues, to modulate tissue growth 
(e.g., firom starting stem cell populations), etc. Usefiil antibodies or other binding partners 
include those that are specific for parts of the polypeptide which are exposed extracellularly. 
Any of the methods described above and below can be accomplished in vivo, in vitro, or ex 

30 vivo. 

Binding partners can also be used as to specifically deliver therapeutic agents to a 
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tissue of interest For example, a gene to be delivered to a tissue can be conjugated to a 
binding partner (directly or through a polymer, etc.), in liposomes comprising cell sur&ce, 
and then administered as appropriate to the subject who is to be treated. Additionally, 
cytotoxic, cytostatic, and other therapeutic agents can be delivered specifically to the tissue to 
5 treat and/or prevent any of the conditions associated with the tissue of interest. 

The present invention relates to methods of detecting melanoma cells, comprising one 
or more of the following steps, e.g., contacting a sample comprising cells with a 
polynucleotide specific for MCR-IC (e.g., amino acids 367-398, and Augments thereof), or a 
mammalian homolog thereof, under conditions effective for said polynucleotide to hybridize 

1 0 specifically to said gene, and detectmg specific hybridization. Detecting can be 

accomplished by any suitable method and technology, including, e,g., any of those mentioned 
and discussed below, such as Northern blot and PGR. Specific polynucleotides include SEQ 
JD NOS 32-34, and complements thereto. 

As indicated above, binding partners can be used to deUver agents specifically to 

1 5 melanocytes, e.g., for diagnostic, therapeutic, and prognostic purposes, including the 

treatment of melanoma. Methods of deUvering an agent to a melanocyte cell can comprise, 
e.g., contacting a melanocyte with an agent coupled to binding partner specific for a 
melanocortin receptor gene of the present invention, whereby said agent is delivered to said 
cell. Any type of agent can be used, including, therapeutic and imaging agents. Contact wifli 

20 the melanocyte (e.g., a melanoma) can be achieved in any effective manner, including by 

administering effective amounts of the agent to a host orally, parentally, locally, systemically, 
intravenously, etc. The phrase "an agent coupled to binding partner^' indicates that the agent 
is associated with the binding partner in such a manner that it can be carried specifically to 
the target site. Coupling includes, chemical bonding, covalent bonding, noncovalent bonding 

25 (where such bonding is sufficient to carry the agent to the target), present in a liposome or in 
a lipid membrane, associated with a carrier, such as a polymeric carrier, etc. The agent can 
be directly linked to the binding partner, or via chemical linkers or spacers. 

hnagmg of specific organs can be facilitated using tissue selective antibodies and 
other binding partners that selectively target contrast agents to a specific site in the body. 

30 Various imaging techniques have been used in this context, including, e.g.. X-ray, CT, CAT, 
MRI, ultrasound, PET, SPECT, and scintographic. A reporter agent can be conjugated or 
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/ associated routinely with a binding partner. Ultrasound contrast agents combined with 
binding partners, such as antibodies, are described in, e.g., U.S. Pat Nos. 6,264,917, 
6,254,852, 6,245,318, and 6,139,819. MRI contrast agents, such as metal chelators, 
radionucleotides, paramagnetic ions, etc., combined with selective targeting agents are also 
5 described in the literature, e.g., in U.S. Pat Nos. 6,280,706 and 6,221,334. The methods 
described therein can be used generally to associate a partner with an agrat for any desired 
purpose. 

A transgenic animal with a dismpted melanocortin-lC receptor can have a 
pigmeatation phenotype, e.g., red or feir hair. Functional disruption of the gene can be 

10 accomplished in any effective way, including, e.g., introduction of a stop codon into any part 
of the coding sequence, e.g., to prevent expression of amino acids 367-398, such that the 
resulting polypeptide is biologically inactive or lacks one or more of its functional regions, 
introduction of a mutation into a promoter or other regulatory sequence that is eflfective to 
turn it off, or reduce transcription of the gene, insertion of an exogenous sequence into the 

1 5 gene which inactivates it (e.g., which disrupts the production of a biologically-active 

polypeptide or which disrupts the promoter or other transcriptional machinery), deletion of 
sequences from the gene, etc. A transgenic animal, or animal cell, lacking one or more 
functional genes of the present invention can be useful in a variety of applications, including, 
as an animal model for conditions and diseases associated with melanocortin-lC, for drug 

20 screening (e.g., by making a cell deficient in MCR-IC, the contribution of the activity 

remaining variants, such as MCR-IB and the 317-amino acid form, can be assessed), as a 
source of tissues deficient in one or more MCR-1 activities. The animal's endogenous locus 
can be replaced with a continuous coding sequence for MCR-IC, such that only MCR-IC is 
expressed, and no other form, such as MCR-IB and the 317-amino add form, are expressed. 

25 PGR based methods can also be used in the methods of detecting polynucleotides for 

human MCR-IC. Jn such methods, more than one probe specific for MCR-IC can be used, 
e.g., a pair of specific polynucleotide probes which are capable of amplifying a 
polynucleotide sequence of MCR-IC, such as corresponding to amino acids 1-366, 367-398, 
etc., of SEQ ID NO 26. For instance, SEQ ID NO 32 is in exon 1, SEQ ID NO 33 spans 

30 exons 2-3, and SEQ ID NO 34 is in exon 4. Thus, in a PCR reaction, SEQ EDS 32 and 33 
produce a firagment about 262 base pairs that is absent in MCR-1 A and MCR-IB. SEQ ID 
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NOS 32 and 34 in a PGR reaction produce a ftagment of about 61 5 base pairs which is absent 
from MCR-1 A and MCR-IB. 

Mutant alleles, polymorphisms, SNPs, etc., can be identified and isolated fix>m 
melanomas and other skin conditions that are known, or suspected to have, a genetic 

5 component Identification of such genes can be carried out routinely (see, above for more 
guidance), e.g., using PGR, hybridization techniques, direct sequencing, mismatch reactions 
(see, e.g., above), RFLP analysis, SSCP (e.g., Qrita et al., Proc. Natl Acad ScL, 86:2766, 
1992), etc., where a polynucleotide having a sequence selected fi^om SEQ ID NO 25 
(especially corresponding to amino acids 367-398) can be used as a probe. The selected 

1 0 mutant alleles, SNPs, polymorphisms, etc., can be used diagnostically to determine whether a 
subject has, or is susceptible to a melanoma or other condition (e.g., pigmentation variation, 
inflammatoiy condition) associated with a melanocortin receptor gene of the present 
invention, as well as to design therapies and predict the outcome of the disorder. 

The present invention an isolated polynucleotide comprising, a polynucleotide 

15 sequence coding without interruption for a human MCR-IC, or complement thereto, said 
MCR-IC having 84% or more amino acid sequence id^tity along its entire length to the 
sequence comprising amino acids 1-316 of SEQ ID NO 26, and 90% or more amino acid 
sequence identity along its entire l^gth to the sequence comprising from amino acids 317- 
398 of SEQ ID NO 26, and which has ligand-binding activity, G-protein binding activity, or 

20 cAMP production activity. 



OTB860 

OTB860 codes for an intracellular polypeptide comprising 1700 amino acids. Its 
nucleotide and amino acid sequences are shown in SEQ ID NOS 38 and 39. Expression of 
25 OTB860 is detected predominantly in heart and brain tissues, with minimally detectable 
levels in breast and testes tissues. The expression pattern is illustrated in Fig. 8. 

As shown in Fig. 9, OTB860 is related to KIAA1678 (also known as AB051465; 
SEQ ID NO 42). It contains 369 amino adds at its N-terminal end which are not present in 
KIAA1678, and a 29 amino acid insertion (1546-1574) at about amino acid position 1545. It 
30 also differs at amino add positions 847 (histidine instead of glutamine) and 867 (arginine 
instead of glutamine). 
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Consistent with fhe expression pattern of OTB860, there are a number of functional 
and developmental pathways that are shared between neuronal and cardiac cells. For 
example, neuregulins (neuregulin-1 — NRGl) play an important role in myocardial and 
neuronal development. Mice deficient in IgL-domain containing neureguUns have severe 
5 defects in the developing heart and nervous system. See, e.g., Kramer et al., Proc. Natl 

Acad, ScL, 93: 4833-4838, 1996; Zhao et al., J. Biol Chem., 273:10261-10269, 1998. These 
effects appear to be mediated by Type I neuregulins. Meyer et al.. Develop, 124:3575-3586, 
1997. The neuroregulin receptor, erbB4, is also highly expressed in heart and brain, although 
its expression is not restricted to these tissues (data not shown). Adhesion pathways are also 

1 0 shared between flie two tissues. For example, alpha4-integrin is expressed in both neuronal 
and cardiac cells. See, e.g., Pmco et al.,Mec/r. Dev., 100:99-103, 2001. 

OTB860 maps to chromosomal band 2q36. A number of disorders have been mapped 
to, or in close proximity to, this diromosome location. These include, e.g., brachydactyly 
type Al, pili torti and nerve deafiiess, syndactyly type 1, Gracile syndrome (growth 

15 retardation and early death), and epilepsy. Recently, in a whole genome scan, 2q36 was 
identified as a locus associated with acute coronary syndrome (Harrap et al,, Arterioscler. 
Thromb, Vase. Biol, 22:874-878, 2002), involving myocardial infarction, unstable angina, 
atherosclerotic plaque disruption, and coronary thrombosis. Nucleic adds of the present 
invention can be used as linkage markers, diagnostic targets, therapeutic targets, for any of 

20 the mentioned disorders, as well as any disorders or genes mappiag in proximity to it. 

OTB860 can be used in diagnostic, therapeutic, prophylactic, and research 
applications. RNA and polypeptide detection methods can be used to determine whether a 
sample comprises neuronal or cardiac tissues. When a positive is obtained, cell type markers 
can be used to determine precisely whether the tissue is neuronal or cardiac. For example, 

25 the presence or absence of a neuronal marker would distinguish between brain and heart 
tissues. Non-limiting examples of neuronal markers include, presenilins, genes and 
polypeptides in the pathways for neurotransmitter synthesis, recqptor, metabolism, etc., (e.g., 
serotonin, MAO, dopamine, norephinephrine, nitric oxide, etc.), apolipoprotein A, APP, 
neuron-specific enolase (NSE), glial fibrillary acidic protein (GFAP), SlOO, GAP-43, neuion- 

30 specific beta-m tubulin, Stac (neuron-specific protein with an SH3 domain, e.g.. Genomics, 
47:140-2, 1998), myelin basic protein, vimentin, etc. Non-limiting examples of heart tissue 
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markers include, cardiac troponin I, and smooth muscle markers such as CRP 1 . 

The present invention also relates to polypeptide detection mefliods for assessing 
heart or brain function, e.g., methods of assessmg heart or brain function, comprising, 
detecting OTB860 polypeptide, or fragments theareof, in a body fluid, whereby the level of 

5 OTB860 polypeptide in said fluid is a measure of heart or brain function. Heart or brain 
function tests are usually performed to determine whether the organ is functioning normally 
as a way of diagnosing disease. Various tests are used to test for heart function, such as 
electrocardiogram, stress test, echocardiogram, oxygen levels, and cardiac enzyme tests (e.g., 
creatine phosphokinase, troponin, lactate dehydrogenase, and myoglobin). Detection of 

10 OTB860 provides an additional assessment tool, especially in diseases such as myocardial 
infarction and other conditions, e.g., where cellular debris, etc., is released systemically. As 
with the other tests, elevated levels of OTB860 in blood, or other fluids, can indicate 
impaired brain or heart function. Values can be deteraiined routinely, as they are for other 
functional markers. 

1 5 OTB860 polynucleotides and polypeptides can be used to treat, prevent and diagnose 

diseases of the heart, including, e.g., acute coronary syndrome, myocardial hypertrophy, heart 
failure, conduction disordres, arrhythmias, bradyarrhythmias, sinus node dysfunction, 
tachyarrhythmias, tachycardias, atrial fibrillation, congenital heart diseases (see, e.g., 
Harrison's Principles of Internal Medicine, Volume 1, 12*^ Edition, 1991, Pages 924-925), 

20 atrial and ventricular septal defects, congenital aortic stenosis, coartation of the aorta, 
valvular heart disease, myocardial infarction, ischemic heart disease, cardiomyopathy, 
perocardial diseases, cardiac tumors (e.g., myxomei, Ipoma, papillary fibroelastoma, 
rhabdomyoma, sarcoma, etc), coronary artery disease, atherosclerosis, aortic aneurysm, etc. 
OTB860 polynucleotides and polypeptides can also be used to treat, prevent and 

25 diagnose diseases of the brain, including, e.g., vascular diseases, hypoxia, ischemia, 
infarction, tumors, neuroglial tumors, astrocytoma, glioblastoma multifome, piloqrtic 
astrocytoma, oUgodendrogUoma, ependymona, choriod plexus papilloma, neuronal tumors, 
neuroblastoma, gangUoneuroma, gangliocytoma, gangliogioma, primitive or undifferentiated 
tumors, medulloblastoma, tumors of meninges, mingioma, lymphomas, demyelinating 

30 diseases, multiple sclerosism perivenous encephalitis, degenerative diseases, Alzhefaner's, 
Pick's, Huntington's. Parkinsonism, ALS, Werdnig-Hoffinan, degenerative diseases of 
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cerebral cortex, ganglia, brainstem, and motor neurons, inborn errors of metabolism, 
demyelinating and dysmyelinating disorders, Pelizaeus-Merzbacher disease, multiple 
sclerosis, various leukodystrophies, post-traumatic demyelination, cerebrovascular (CVS) 
accidents, neuritis, nexiropathies, particularly, multifocal leucoencephalopathy, Guillain-Barre 

5 syndrome, retrobulbar neuritis, acute rubella encephalitis, chronic relapsing polyneuropathy, 
intravasciilar lymphomatosis, Krabbe disease, globoid cell leukodystrophy, subacute 
combined degeneration of the spinal cord and brain, allergic encephalitis, murine 
caronavirus, hepatitis virus infection, or Theiler's murine encephalomyelitis, prion diseases, 
Creutzfeldt- Jakob, especially, febrile familial convulsions, epilepsy, vascularneuromyopathy, 

1 0 cerebellar ataxia, etc. 

When expression is described as being '^predominantly" in a given tissue, this 
indicates that the gene's mRNAs levels are highest in this tissue as compared to the other 
tissues in which it was measured. Expression can also be "selective," where expression is 
observed. By the phrase "selectively expressed," it is meant that a nucleic acid molecule 

1 5 comprising flie defined sequence of nucleotides, when produced as a transcript, is 

characteristic of the tissue or cell-type in which it is made. This can mean that the transcript 
is expressed only in that tissue and in no other tissue-type, or it can mean that the transcript is 
expressed preferentially, differentially, and more abundantly (e.g., at least 5-fold, 10-fold, 
etc., or more) in that tissue when compared to other tissue-types. 

20 The present invention relates to methods of detecting brain or heart cells, comprising 

one or more of the following steps, e.g., contacting a sample comprising cells with a 
polynucleotide specific for OTB860 (e.g., SEQ ID NOS 40-41), or a mammalian homolog 
thereof, under conditions effective for said polynucleotide to hybridize specifically to said 
gene, and detecting specific hybridization. Detecting can be accomplished by any suitable 

25 method and technology, including, e.g., any of those mentioned and discussed below, such as 
Norttiem blot and PGR. Specific polynucleotides include SEQ ID NOS 40-41, and 
complements thereto. 

Detection can also be achieved using bindiag partners, such as antibodies (e.g., 
monoclonal or polyclonal antibodies) that specifically recognize polypeptides coded for by 

30 genes of the present invention. Thus, the present invention relates to methods of detecting a 
brain or heart cell, comprising, one or more the following steps, e.g. contacting a sample 
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comprising cells with a binding partner (e.g. an antibody, an Fab fragment, a single-chain 
antibody, an aptamer) specific for a polypeptide coded for by OTB860 (e.g., SEQ ID NO 39), 
or a mammalian homolog thereof imder conditions effective for said binding partner bind 
specifically to said polypeptide, and detecting specific binding. Protein binding assays can be 

5 accompUshed routinely, e.g., using immunocytochemistry, ELISA format. Western blots, etc. 
Usefiil epitopes include those exposed to the surface. 

A brain or heart cell (see above for examples of brain or heart cell types) can also be 
modulated in accordance with the present invention, e.g., by methods of modulating a brain 
or heart cell, comprising, e.g., contacting said cell with an agent effective to modulate 

10 OTB860, or tihie biological activity of a polypeptide encoded thereby (e,g., SEQ ID NO 39), 
or a maimnalian homolog thereof, whereby said brain or heart cell is modulated. Modulation 
as used throu^out includes, e.g., stimulating, increasing, agonizing, activating, amplifying, 
blocking, inhibiting, reducing, antagonizing, preventing, decreasing, diminishing, etc. Any 
activity or fimction of the brain or heart cell can be modulated, including, e.g., development, 

15 differentiation, signaling, excitability, etc. 

The present invention also relates to methods of modulating development of cardiac 
or neuronal cells, comprising, e.g., administering an agent which is effective for modulating 
the expression of OTB860, or the biological activity of a polypeptide encoded thereby, 
whereby the development of said cardiac or neuronal cell is modulated. Development is 

20 meant to include any process in which a cell or tissue matures, including differentiation, 
organogenesis, cell proliferation, cell survival, expression and induction of fimctional 
molecules, cell movement and migration, apoptosis, modulation of gene expression, 
trabeculation, etc. Examples of heart and brain development that can be modulation are 
disclosed in Kramer et al., Proc. Natl Acad. ScL, 93:4833-4838, 1996. Any agent can be 

25 used to modulate development in any environment, e.g., in situ, in vivo, or in vitro, 

OTB860 can be used to detect, modulate, eta, any of the cell types present in the 
heart or brain, including, but not limited to, heart cells comprising the coverings of the heart 
(e.g., pericardium, fibrous pericardium, serous pericardium containing the parietal layer and 
epicardium), heart wall comprising mycardium, cardiac muscle, and endocardium 

30 (endothelial), blood vessels, valves, and autorhythmic cardiac cells such as those in the SA 
and VA nodes, brain and other neuronal cells, such neurons, glia, microglia, ependymal cells. 
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oligodendrocyteSy Schwann cells, and satellite cells. 

Promoter sequences obtained from OTB860 can be utilized to selectively express 
heterologous genes in brain or heart cells- Mediods of expressing a hetocologpus 
polynucleotide in brain or heart cells can comprise, e.g., expressing a nucleic acid constract 
5 in brain or heart cells, said construct comprising a promoter sequence operably linked to said 
heterologous polynucleotide, wherein said promoter sequence is obtained from OTB860, e.g., 
on genomic NT_022115.8. In addition to the cell lines mentioned below, the constmct can 
be expressed in primary cells or in established cell Unes. 

The present invention also relates to methods of modulating development of cardiac 
10 or neuronal cells, comprising, e.g., administering an agent which is effective for modulating 
the expression of OTB60, or the biological activity of a polypeptide encoded thereby, 
whereby the development of said cardiac or neuronal cell is modulated. 

The present invention also relates to a mammalian cell whose genome comprises a 
ftmctional disruption of the human OTB860 gene vnibm a polynucleotide sequence coding 
15 for amino add residues 1-369 (SEQ ID NO 39) or 1546-1574 (SEQ ID NO 39). A non- 
human, transgenic mammal comprising such a cell can have a heart or neuronal tissue defect. 

Antibodies can be produced, e.g., an antibody which is specific-for: an epitope 
comprising amino acid 847, amino acid 867, or an epitope contained with amino acids 1-369 
(SEQ ID NO 39), or amino acids 1546-1574 (SEQ ID NO 39). 

20 

TARRP 

Human TARPP (thymocyte cyclic AMP regulated phosphoprotein, or, Brl37A, B, C, 
D, and E) is represented by a family of altemative splice variants. Figs. 10-12 summarize the 
differences between the multiple forms. Brl37E is an 847 amino acid polypeptide. Its 

25 nucleotide and amino add sequraces are shown in SEQ ID NOS 43 and 44. Brl 37B (SEQ 
ID NO 47 and 48) has a deletion of amino acids 267-300, Brl37A (SEQ ID NO 45 and 46) 
has a deletion of amino acids 312-331, and Brl37C (SEQ ID NO 49 and 50) has a deletion of 
both these domains. Brl37D (SEQ ID NO 51 and 52) contains only the first 87 amino adds 
followed by a two-amino acid N-terminus which differs fix)m flie other forms. A partial 

30 clone, AL133109(SEQIDNO55)asshowninFig, 10, is missing the first 161 aminoadds 
of Brl37E, as well as having an amino acid difference at position 312 (SEQ ID NO 44). 
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Brl37E contains a nuclear localization signal at about amino adds 107-124, an R3H 
domain (single-stranded nucleic acid binding domain) at about amino adds 147-224, and a 
proline rich region at about amino adds 476-682. These domains are also present in the A-C 
splice forms, but at different amino add positions. Human TARPP has nucldc add binding 

5 activity conferred by the corresponding binding domain indicating that it can bind nucldc 
adds, preferably single-stranded DNA or RNA. This binding activity can be assayed 
routinely, e.g., using gel electrophoresis band shift assays, e.g., as carried out in, e.g., U.S. 
Pat No. 6,333,407 and 5,789,538, ELISA-based assays (e.g.. Mercury™ TransFactor Kit 
from Clontech), and other assays which detect DNA-protein interactions. 

10 The Brl37 family represent the human homologs of murine TARPP (thymocyte 

ARPP) (NM_033264; SEQ ID NO 53; "Mouse" in Fig. 12). Brl37E has about 83% amino 
acid identity and 87% homology with it (calculated using the BLAST algorithm). See, Fig. 
12 (NM_033264 is murine TARPP). In addition to amino add sequence diflferences 
between ttie two proteins, human TARPP has an insertion at about amino acid positions 549- 

1 5 572 of SEQ ID NO 44 which is not present in the mouse protein. See, Fig. 12. 

Originally, a 21 kDa polypeptide was isolated from rat basal ganglia based on its 
phosphorylation by cAMP-dq)endent protein kinase (PKA). Williams et al., /. NeuroscL, 
9:3631-3637,1989. It was named ARPP-21 (cAMP-regulatedphosphoprotein). Activation 
of dopamine receptors resulted in an increase in the phosphorylation of ARPP-21 , Caporaso 

20 etal.,iVei/rop/w?r/n., 39:1637-1644,2000. Human ARPP-21 (Brl37D) contains 89 amino 
acids (NM__016300; SEQ ID NO 52). 

A high molecular weight polypeptide of ARPP-21 was subsequently identified in T- 
cells and named TARPP. Kisielow et al., £wr J. //wwuwo/., 31:1141-1149, 2001. This 
polypeptide contains ARPP-21 sequence at its 5' end, but a novel 3' end coding for more 

25 than 700 additional amino acids (for a total of 807 amino acids). Murine TARPP appears to 
be involved in the regulation of thymocyte maturation and TCR rearrangement Expression 
of TARPP is down-regulated after the TCR signals delivered. It is highly expressed in 
immature thymocytes and is associated with the commitment to the T-cell lineage, making it 
highly selective marker for T-cell commitment See, Kisielow, ibid. After commitment to 

30 the T-cell hneage during positive selection, its expression is turned off. 

There appear to be several members of the human TARPP family. KIAA0029 is a 
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hypoihetical protein that shares about 45% amino add sequence identity and 59% homology 

with Brl37E. KIAA1002, a second hypothetical protein, has about 42% amino add identity 

and 54% homology to it. 

Human TARPP is highly expressed in brain, pituitary, muscle, and thymus. It is 
5 expressed at lower levels in adrenal gland, bone marrow, heart, small intestine, kidney, liver, 

ovary, prostate, stomach, testis, and tiiyroid. There was virtually no detectable expression in 

colon, lung, lymph node, peripheral lymphocytes, mammary gland, pancreas, and uterus. 

As indicated by its expression pattem, human TARPP is involved the maturation of 

T-cells, especially in the rearrangement of the TCR. For this reason, it can be used to 
10 modulate T-cells, e.g., in allergy, autoimmune disease (e.g., rheumatoid arthritis and multiple 

scleiosis), and graft-host disease. It can also be used a marker to determine the index of 

mature versus immature T-cells, where human TARPP is marker of immature T-cells. 

Additionally, human TARPP is phosphorylated upon dopamine receptor activation, 

indicating an involvement in dopamine pathways. Consequently, it is target for diseases that 
15 involve dopamine, including, e,g., schizophrenia, substance abuse and addiction, anxiety, 

Parkinson's disease, and other dopaminergic diseases and conditions. 

Human TARPP is localized to chromosomal band 3p21.33. There are several 

disorders genetically mapped to this region, including, e.g., retinal vasculopathy with cerebral 

leukodystrophy (OMIM 192315), deafiiess, neurosensory, autosomal recessive 6 (OMIM 
20 600971), and lung cancer. Nucleic acids of the present invention can be used as liiikage 

markers, diagnostic targets, therapeutic targets, for any of the mentioned disorders, as well as 

any disorders or genes mapping in proximity to it. 

Diseases or disorders which can be treated in accordance with the present invention 

include, but are not limited to autoimmune disease, such as multiple sclerosis and rheumatoid 
25 arthritis, and allergy 

The gene can be disrupted in a spedfic region, e.g., in the sequence coding for amino 

adds 1 -1 6 1 of a human TARPP. Cells and/or animals can also have targeted deletions, e.g., 

deletion of a coding sequence for amino adds 267-300 and/or 312-331 of a human TARPP. 

One or more the different splice forms, Brl37A-E can also be knocked-out or disrupted, e.g., 
30 to dissect out the individual activities. 
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The present invention relates to methods of modulating T-cells, comprising, 
contacttQg T-cells with an agent which is effective for regulating a human TARPP gene 
expressed in said cells, or for modulating the biological activity of a polypeptide encoded 
thereby. 

5 Included also in the present invention are engineered cells, e.g., a human cell whose 

genome comprises a functional disruption of human TARPP in the region comprising the 
coding sequence for amino acids 1- 161 of a human TARPP of SEQ ID NO 44, or a human 
ceU whose genome comprises a deletion of a coding sequence for amino acids 267-300 
and/or 312-331 of a human TARPP of SEQ ID NO 44. 

1 0 Antibodies can be produced, e.g., an antibody which is specific-for a human TARPP, 

said antibody which is specific for an epitope present in amino acid sequences 1-161, 88-161, 
267-300, 3 12-33 1, or a polypeptide comprising amino acid 3 12, of a human TARPP of SEQ 
ID NO 44. 

15 LAT-1 

Liver-associated transmembrane protein-1 CTLAT-1" or TMD008) codes for a 
polypeptide comprising 276 amino adds. Its expression is highly restricted to the liver, i.e., 
it is predominantly expressed in the liver. The nucleotide and amino acid sequences of it are 
shown in SEQ ID NOS 58 and 59. It contains transmembrane domains at about amino acid 

20 positions 24-46, 59-81, 101-123, 144-166, 203-225, and 237-259. It is homologous to the 
olfactory class of GPCR receptors. LAT-1 is also known as XM_060456 and AX242289. 

The gene for LAT-1 maps to chromosomal band lq22. Several different disorders 
map to this location, including, e.g., porphyria variegata, progression of lymphoma, 
Zellweger syndrome, Charcot-Marie-Tooth neuropathy- IB, congenital hypomyelination, 

25 nemaline myopathy, and CDS zeta chain deficiency, medullary thyroid carcinoma, 

susceptibility to Vivax malaria, schizophrenia susceptability locus, autosomal dominant 
deafiiess, susceptibility to Lupus nephritis, familial hemiplegic migraine, apolipoprotein A-E 
deficiency, and familial hyperlipidemia. Nucleic adds of the present invention can be used 
as linkage markers, diagnostic targets, therapeutic targets, for any of the mentioned disorders, 

30 as well as any disorders or genes mapping in proximity to it. 

LAT-1 can be used as a diagnostic and prognostic marker for liver fimction and 
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disease, including any of the liver diseases already mentioned For instance, blood serum 
levels of LAT-1 (as well as other bodily fluids) can be used as an indicator of liver disease, 
especially those diseases characterized by necrotic and degenerative lesions, such as hepatitis, 
toxicity, and cirrhosis. Any condition which results in degeneration of the liver can result in 

5 the appearance of higher than normal amounts of blood serum LAT-1 . LAT-1 can be used 
alone, or in combination with other molecular markers for liver function, such as bilirubin, 
serum aminotransferases (e.g., AST and ALT), alkaline phosphatase, gamma- 
glutamyltranspeptidase (GGT), albumin, globulin, and blood anamonia. 

Because of the selectivity of LAT-1 for the liver, it is a useful target for both 

1 0 histological and therapeutic applications. Antibodies and other LAT- 1 binding partners can 
be used to selectively target agents to liver tissue for any purpose, included, but not limited 
to, imaging, therapeutic, diagnostic, drug delivery, gene ttiempy, etc. For example, LAT-1 
binding partners, such as antibodies, can be used to treat liver carcinoma, in analogy to how 
c-erbB-2 antibodies are used to breast cancer, to detect metastatic liver cells, etc. Useful 

15 antibodies or other binding partners include those that are specific for parts of LAT-1 which 
are exposed extracellularly, e.g. , amino acids 1-23, 82-100, 167-202, etc. 

Imaging of specific organs can be facilitated using agents, such as LAT-1, tiiat can be 
used to selectively target contrast agents to a specific site in the body. Various imaging 
techniques have been used in this context, including, e.g.. X-ray, CT, CAT, MRI, ultrasound, 

20 PET, SPECT, and scintographic. A reporter agent can be conjugated or associated routinely 
with a LAT-1 binding partner. Ultrasound contrast agents combined with binding partners, 
such as antibodies, are described in, e.g., U.S. Pat. Nos, 6,264,917, 6,254,852, 6,245,318, and 
6,139,819. MRI contrast agents, such as metal chelators, radionucleotides, paramagnetic 
ions, etc., combined with selective targeting agents are also described in the literature, e-g., in 

25 U.S. Pat Nos. 6,280,706 and 6,221,334. The methods described therein can be used 
generally to associate a LAT-1 binding partner with an agent for any desired purpose. 

LAT-1 binding partaers can also be used as to specifically deliver therapeutic agents 
to the liver. For example, hypercholesterolemia and other metabolic diseases can be treated 
by gene therapy, using the LAT-1 to specifically deliver the LDL receptor to the liver. The 

30 gene can be conjugated to a LAT-1 bindmg partner (directly or through a polymer, etc.), in 
liposomes comprismg cell surface. Additionally, cytotoxic, cytostatic, and otiier therapeutic 
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agents can be delivered to flie liver via LAT-1 to treat and/or prevent any of the above- 
mentioned conditions associated with liver disease, e.g., cardnoma. 

The liver is the largest and most metabolically complex organ in ttie body. Its 
fimctions include, e.g., storage of iron, production of bile to facilitate digestion, 
detoxifications of various exogenous chemicals, including alcohol and many drugs, energy 
stoclq)iling (carbohydrates and fat), production of clotting factors, and manufacture of blood. 
There are a number of diseases which affect the liver, iacluding, Alagille syndrome, 
alcoholic liver disease, alpha- 1 -antitrypsin deficiency, autoimmune hepatitis, Budd-Chiari 
syndrome, biliary atresia, Byler disease, liver cancer, Caroli disease, cirrhosis, Crigler-Najjar 
syndrome, Dubin- Johnson syndrome, fetty liver, galactosemia, Gilbert syndrome, glycogen 
storage disease, hemangioma, hemochromatosis, hepatitis A-G, porphyria cutanea tarda, 
primary biliary cirrhosis, protoporphyria, erythrohepatic. Rotor syndrome, sclerosing 
cholangitis, and Wilson disease. Liver disease is of grave concern around the world. 

The liver is divided into many small units, known as lobules. The lobule is the 
structural unit of the Uver. Each lobule is comprised of radial plates of liver cells, called 
hepatocytes, and is surroimded by a connective sheath. A central vein ("CV") is located in 
the middle, and there are portal triads at flie vertices. Each triad comprises a branch of the 
hepatic artery (supplying arterial blood to the lobule), a branch of the hepatic portal vein 
(carrying nutrient-rich blood fi-om the digestive viscera), and a bile duct. The blood fix)m the 
artery and portal vein flow into leaky capillaries, the liver sinusoids, which are located 
between the hepatic plates of the lobule. 

The acinus is the functional unit of the liver. While the boundaries of the lobule are 
well visible, those ofthe acinus are uiirecognizable under the inicroscope. Arising like a 
berry, a grape (latin "acinus") on the vine aroimd the portal triad, the Uver acinus is formed of 
a mass of liver cells and sinusoids which drain toward two adjacent central veins. The 
principal metabolic functions of the liver are performed by hepatocytes. These functions 
include, e.g., formation and excretion of bile, regulation of carbohydrate homeostasis, Upid 
synthesis and secretion of plasma lipoproteins, regulation of cholestm)! metaboUsm, 
formation of urea, serum albumin, clotting factors, enzymes, and numerous oiher proteins; 
and metaboUsm or detoxification of drags and other foreign substances. Hepatocytes in 
different regions of the acinus perform diflEerent functions, e.g., ^uconeogenesis is primarily 
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a function of tiie zone of cells closest to flie triad, whereas glycolysis mainly occurs in flie 
tardiest zone from it. 

A promoter obtained from the LAT-1 can be used, e.g., in gene therapy to obtain 
tissue-specific expression of a heterologous gene (e.g., coding for a therapeutic product or 

5 cytotoxin). A promoter sequence is foxmd at about nucleotide positions 1 164-12 1 2 of SEQ 
ID NO 58 and can be used (e.g., 1 164 to the first ATG codon) to drive Uver-specific 
expression of a heterologous sequence. 5' and 3' sequences (including, UTRs and introns) 
can be used to modulate or regulate stability, transcription, and translation of nucleic adds, 
including the sequence to which is attached m nature, as well as heterologous nucleic acids. 

10 A polyadenyiation site is found at about nucleotide positions 4265-4275 of SEQ ID NO 58. 
The upstream 3'UTR can be used as described above. UseM polypeptides include 
polypeptides e3q>osed extracellularly, e.g. , amino adds 1-23, 82-100, 167-202, of SEQ ID 
NO 59, etc. 

The present invention also relates to mettiods of detecting human liver tissue in a 

15 sample, e.g., comprising tissue, cells, or other cellular materials or debris, comprising one or 
more of the following steps, e.g., contacting said sample with a binding partner specific for 
human LAT-1 xmder conditions effective for said binding partner to bmd specifically to 
human LAT-1, and detecting spedfic binding between said binding partner and said human 
LAT-1 , whereby specific binding indicates that liver tissue is present in said sample. 

20 The sample can be contacted with the binding partner in any manner which is 

effective to give the binding partner access to the material present in the tissue sample. How 
contact is achieved can depend on flie format of the detection assay. For instance, if a ELIS A 
assay is used, and the binding partner is an antibody on a solid phase in a well, then placing 
an aqueous sample in the well would achieve contact between partner and sample. Any type 

25 of sample can be used, including, e.g., blood (whole blood, fiactionated blood, serum, etc.), 
stool, urine, cerebral spinal fluid, tissue biopsy, etc. 

The binding partner, such as a monoclonal or polyclonal antibody, is specific for 
LAT-1, and is contacted with the sample under conditions effective for said binding partner 
to bind specifically to human LAT-1, if human LAT-1 is present in the sample. Specific 

30 binding, as previously discussed for polynucleotides, indicates that the binding partner binds 
or attaches to its target polypeptide without significant binding to otiier polypeptides C*non- 
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specific binding'') in ttie sample. Tliis concept is well known in the art The detection of 
specific binding can be accomplished by any of the aforementioned assays. 

The present invention also relates to polypeptide detection mefliods for assessing hver 
function, e.g., methods of assessing liver function, comprising, detecting LAT-1 polypeptide, 

5 or fragments thereof, in a body fluid, whereby the level of LAT-1 polypeptide in said flnid is 
a measure of Uver fimction. Liver function tests are usually perfomied to detemiine whethCT 
the Uver is functioning normally as a way of diagnosing liver disease. Various tests are 
commonly vised, including, e.g., alkaline phosphatase, alanine transferase, aspartate 
transferase, bilimbin, gamma-glutamyl transpeptidase, lactic dehydrogenase, 5 '-nucleotidase, 

10 albumin, alpha-fetoprotein, mitochondrial antibodies, and prothrombin time. See, e.g., 

Harrison's I^nciples of Internal Medicine, Yoh^^ 1308-1317, 12*^ Edition, 199L 

Detection of LAT-1 provides an additional assessment tool, especially in diseases such as 
hepatitis, carcinoma, liver toxicity, cirrhosis, and other liver conditions, e.g., where cellular 
debris, etc., is released systemically. As with the other tests, elevated levels of LAT-1 in 

1 5 blood, or other fluids, can indicate impaired liver function. Values can be determined 
routinely, as they are for other liver function markers. 

Nucleic acids 

A maromalian polynucleotide, or fragment thereof, of the present invention is a 
20 polynucleotide having a nucleotide sequence obtainable from a natural source. When the 
species name is used, e.g., a human, it indicates that the polynucleotide or polypeptide is 
obtainable from a natural source. It therefore includes naturally-occurring normal, naturally- 
occurring mutant, and naturally-occurring polymorphic alleles (e.g., SNPs), differentially- 
spliced transcripts, splice-variants, etc. By the term ^^naturally-occurring," it is meant that the 
25 polynucleotide is obtainable from a natural source, e.g., animal tissue and cells, body fluids, 
tissue culture cells, forensic samples. Natural somrces include, e.g,, living cells obtained from 
tissues and whole organisms, tumors, cultured cell lines, including primary and inamortalized 
cell lines. Naturally-occurring mutations can include deletions (e.g., a truncated amino- or 
carboxy-terminus), substitutions, inversions, or additions of nucleotide sequence. These 
30 genes can be detected and isolated by polynucleotide hybridization according to methods 
which one skilled in tiie art would know, e.g., as discussed below. 
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A polynucleotide according to the present invention can be obtained from a variety of 
different sources. It can be obtained from DNA or RNA, such as polyadenjdated mRNA or 
total RNA, e.g., isolated from tissues, cells, or whole organism. The polynucleotide can be 
obtained directly from DNA or RNA, from a cDNA Ubrary, from a genomic library, etc. The 

5 polynucleotide can be obtained from a cell or tissue (e.g., from an embryonic or adult tissues) 
at a particular stage of development, having a desired genotype, phenotype, disease status, 
etc. A polynucleotide which "codes without interruption" refers to a polynucleotide having a 
continuous open reading frame ("ORF") as compared to an ORF which is interrupted by 
introns or other noncoding sequmces. 

10 Polynucleotides and polypeptides (including any part of a differentially regulated 

cancer gene) can be excluded as compositions from the present invention if, e.g., listed in a 
publicly available databases on the day fliis application was filed and/or disclosed in a patent 
appUcation having an earlier filing or priority date than this application and/or conceived 
and/or reduced to practice earlier than a polynucleotide in this application. 

1 5 As described herem, the phrase "an isolated polynucleotide which is SEQ ID NO," or 

"an isolated polynucleotide which is selected from SEQ ID NO,'* refers to an isolated nucleic 
add molecule from which ttie recited sequence was derived (e.g., a cDNA derived firom 
mRNA; cDNA derived from genomic DNA). Because of sequencing errors, typographical 
errors, etc., the actual naturally-occurring sequence may differ from a SEQ ID listed herein. 

20 Thus, the phrase indicates the specific molecule from which the sequence was derived, rather 
than a molecule having that exact recited nucleotide sequence, analogously to how a culture 
depository number refers to a specific cloned fragment in a cryotube. 

As explained in more detail below, a polynucleotide sequence of the invention can 
contain the complete sequence as shown in the corresponding SEQ ID, degenerate sequences 

25 thereof, anti-sense, muteins thereof genes comprising said sequences, fiiU-length cDNAs 
comprising said sequences, complete genomic sequences, fragments thereof homologs, 
primers, nucleic acid molecules which hybridize thereto, derivatives thereof, etc. 

Genomic 

30 The present invention also relates genomic DNA from which the polynucleotides of 

the present invention can be derived. A genomic DNA coding for a human, mouse, or other 
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mammalian polymicleotide, can be obtained routinely, for example, by screening a genomic 
library (e.g., a YAC library) with a polynucleotide of the present invention, or by searching 
nucleotide databases, such as GraBank and EMBL, for matdies. Promoter and other 
regulatory regions (including both 5* and 3' regions, as well introns) can be identified 

5 upstream or downstream of coding and expressed RNAs, and assayed routinely for activity, 
e.g., by joining to a reporter gene (e.g., CAT, GFP, alkaline phosphatase, luciferase, 
galatosidase). A promoter obtained from a gene can be used, e.g., in gene therapy to obtain 
tissue-specific expression of a heterologous gene (e.g., coding for a therapeutic product or 
cytotoxin). 5' and 3' sequences (including, UTRs and introns) can be used to modulate or 

1 0 regulate stability, transaription, and translation of nucleic acids, including the sequence to 
which is attached in nature, as well as heterologous nucleic acids. 



Constructs 

A polynucleotide of the present invention can comprise additional polynucleotide 

1 5 sequences, e.g., sequences to enhance expression, detection, uptake, cataloging, tagging, etc. 
A polynucleotide can include only coding sequence; a coding sequence and additional non- 
naturally occurring or heterologous coding sequence (e.g., sequences coding for leader, 
signal, secretory, targeting, enzymatic, fluorescent, antibiotic resistance, and other fimctional 
or diagnostic peptides); coding sequences and non-coding sequences, e.g., untranslated 

20 sequences at either a 5* or 3' end, or dispersed in the coding sequence, e.g., introns. 

A polynucleotide according to the present invention also can comprise an expression 
control sequence operably linked to a polynucleotide as described above. The phrase 
"expression control sequence" means a polynucleotide sequence that regulates expression of 
a polypeptide coded for by a polynucleotide to which it is functionally ("operably") linked^ 

25 Expression can be regulated at the level of the mRNA or polypeptide. Thus, the expression 
control sequence includes mRNA-related elements and protein-related elements. Such 
elements include promoters, enhancers (viral or cellular), ribosome binding sequences, 
transcriptional terminators, etc. An expression control sequence is operably linked to a 
nucleotide coding sequence when the expression control sequence is positioned in such a 

30 manner to effect or achieve expression of the coding sequence. For example, when a 

promoter is operably linked 5' to a coding sequence, expression of the coding sequence is 
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driven by the promoter. Expression control sequences can include an initiation codon and 
additional nucleotides to place a partial nucleotide sequence of the present invmtion in-frame 
in order to produce a polypeptide (e.g., pET vectors from Promega have been designed to 
permit a molecule to be inserted into all three reading fimnes to identify the one that results 
5 in polypeptide expression). Expression control sequences can be heterologous or endogenous 
to the normal gene. 

A polynucleotide of the present inv^ation can also comprise nucleic acid vector 
sequences, e.g., for cloning, expression, amplification, selection, etc. Any effective vector 
can be used. A vector is, e.g., a polynucleotide molecule which can replicate autonomously 

10 in a host cell, e.g,, containing an origin of replication. Vectors can be usefiil to perform 

manipulations, to propagate, and/or obtain large quantities of the recombinant molecule in a 
desired host A skilled worker can select a vector depending on the purpose desired, e.g., to 
propagate the recombinant molecule in bacteria, yeast, insect, or mammalian cells. The 
following vectors are provided by way of example. Bacterial: pQE70, pQE60, pQE-9 

15 (Qiagen), pBS, pDlO, Phagescript, phiX174, pBKPhagemid, pNH8A, pNH16a, pNHlSZ, 
pNH46A (Stratagene); Bluescript KS+H (Stratagene); ptrc99a, pKK223-3, pKK233-3, 
pDR54 0, pRTTS (Pharmacia). Eukaryotic: PWLNEO, pSV2CAT, pOG44, pXTl, pSG 
(Stratagene), pSVK3, PBPV, PMSG, pSVL (Pharmacia), pCR2.1/TOPO, pCRD/TOPO, 
pCR4/TOPO, pTrcHisB, pCMV6-XL4, etc. However, any other vector, e.g., plasmids, 

20 viruses, or parts thereof, may be used as long as they are replicable and viable in the desired 
host. The vector can also comprise sequences which enable it to replicate in the host whose 
genome is to be modified. 

Hybridization 

25 Polynucleotide hybridization, as discussed in more detail below, is usefid in a variety 

of applications, including, in gene detection methods, for identifying mutations, for making 
mutations, to identify homologs in the same and different species, to identify related 
members of the same gene family, in diagnostic and prognostic assays, in therapeutic 
applications (e.g., where an antisense polynucleotide is used to inhibit expression), etc. 

30 The ability of two single-stranded polynucleotide preparations to hybridize together is 

a measure of their nucleotide sequence complementarity, e.g., base-pairing between 
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nucleotides, such as A-T, G-C, etc. The invention thus also relates to ^lynucleotides, and 
their complements, which hybridize to a polynucleotide comprising a nucleotide sequence as 
set forth in tiie sequences disclosed herein, and genomic sequences thereof. A nucleotide 
sequence hybridizing to the latter sequence will have a complementary polynucleotide strand, 
or act as a template for one in flie presence of a polymerase (i.e., an appropriate 
polynucleotide synthesizing enzyme). The present invention includes both strands of 
polynucleotide, e.g., a sense strand and an anti-sense strand. 

Hybridization conditions can be chosen to select polynucleotides which have a 
desired amount of nucleotide complementarity with the nucleotide sequaices set forth in 
SEQ ID NOS 1, 11, 16, 25, 38, 43, 45, 47, 49, 51, and/or 58and genomic sequences thereof. 
A polynucleotide capable of hybridizing to such sequence, preferably, possesses, e.g., about 
70%, 75%, 80%, 85%, 87%, 90%, 92%. 95%, 97%, 99%. or 100% complementarity, 
between the sequences. The preset invention paiticulariy relates to polynucleotide 
sequences which hybridize to the nucleotide sequences set forth in the sequence disclosure 
herein or genomic sequences thereof, under low or high stringency conditions. These 
conditions can be used, e.g., to select corresponding homologs in non-human species. 

Polynucleotides which hybridize to polynucleotides of the present invention can be 
selected in various ways. Filter-type blots (i.e., matrices containing polynucleotide, such as 
nitrocellulose), glass chips, and other matrices and substrates comprising polynucleotides 
(short or long) of interest, can be incubated in a prehybridization solution (e.g., 6X SSC, 
0.5% SDS, 100 ng/ml denatured salmon sperm DNA, 5X Denhardt's solution, and 50% 
foimamide), at 22-68°C, overnight, and then hybridized with a detectable polynucleotide 
probe under conditions appropriate to achieve the desired stringenqr. In general, when higji 
homology or sequence identity is desired, a high temperature can be used (e.g., 65 °C). As 
the homology drops, lower washing temperatures are used. For salt concentrations, the lower 
the salt concentration, the hi^er the stringency. The length of the probe is anottier 
consideration. Very short probes (e.g., less than 100 base pairs) are washed at lower 
temperatures, even if the homology is higjL With short probes, formamide can be omitted. 
See, e.g., Current Protocols in Molecular Biology, Chapter 6, Screening of Recombinant 
Libraries; Sambrook et al.. Molecular Cloning, 1989, Chapter 9. 
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For instance, high stringencqr conditions can be achieved by incubating the blot 
ovemi^t (e.g., at least 12 hovirs) with a polynucleotide probe in a hybridization solution 
containing, e.g., about 5X SSC, 0.5% SDS, 100 ng/ml denatured salmon sperm DNA and 
50% formamide, at 42°C, or hybridizing at 42''C in 5X SSPE, 0.5% SDS, and 50% 
formamide, 100 ^ig/ml denatured sahnon sperm DNA, and washing at 65°C in O.IX SSC and 
0.1% SDS. 

Blots can be washed at high stringency conditions that allow, e.g., for less than 5% 
bp mismatch (e.g., wash twice in 0.1% SSC and 0.1% SDS for 30 min at 65°C), Le., 
selecting sequences having 95% or greater sequence idraitity. 

Other non-limiting examples of high stringency conditions includes a final wash at 
65*'C in aqueous buffer containing 30 mM NaCl and 0.5% SDS. Another example of high 
stringent conditions is hybridization in 7% SDS, 0.5 M NaP04, pH 7. 1 mM EDTA at 50»C, 
e.g., overnight, followed by one or more washes with a 1% SDS solution at 42°C. 

Whereas high stringency washes can allow for, e.g., less than 10%, less than 5% 
mismatch, etc., reduced or low stringency conditions can permit up to 20% nucleotide 
mismatch. Hybridization at low stringency can be accompUshed as above, but using lower 
formamide conditions, lower temperatures and/or lower salt concentrations, as well as longer 
periods of incubation time. 

Hybridization can also be based on a calculation of melting temperature (Tm) of the 
hybrid formed betweoi the probe and its target, as described in Sambrook et al.. Generally, 
the temperature Tm at which a short oligonucleotide (containing 18 nucleotides or fewer) 
will melt fiom its target sequence is given by the following equation: Tm = (number of A's 
and T's)x2''C + (number of C's and G's)x4''C. For longer molecules, Tm = 81.5 + 16.6 
logioD^a^ + 0.41 (%GC) - 600/N where [Na"^ is the molar concentration of sodium ions, 
%GC is the percraitage of GC base pairs in Ihe probe, and N is flie length. Hybridization can 
be carried out at several degrees below this temperature to ensure that the probe and target 
can hybridize. Mismatches can be allowed for by lowering the temperature even finther. 

Stringent conditions can be selected to isolate sequences, and their complements, 
which have, e.g., at least about 90%, 95%, or 97%, nucleotide complementarity between the 
and a target polynucleotide. 
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Other homologs of polynucleotides of the present invention can be obtained from 
mammalian and non-mammalian sources according to various methods. For example, 
hybridization with a polynucleotide can be employed to select homologs, e.g., as described in 
Sambrook et al.. Molecular Cloning, Chapter 11, 1989. Such homologs can have varying 

5 amounts of nucleotide and amino acid sequence identity and similarity to such 

polynucleotides of the present invention. MammaUan organisms include, e.g., mice, rats, 
monkeys, pigs, cows, etc. Non-mammalian organisms include, e.g., vertebrates, 
invertebrates, zebra fish, chicken, Drosophila, C. elegans, Xenopus, yeast such as S. pombe, 
S. cerevisiae, roundworms, prokaryotes, plants, Arabidopsis, artemia, viruses, etc. The 

10 degree of nucleotide sequence identity between human and mouse can be about, e.g. 70% or 
more, 85% or more for open reading frames, etc. 

Alignment 

Alignments can be accomplished by using any effective algorithm. For pairwise 

1 5 aligranents of DNA sequences, the methods described by Wilbur-Lipman (e.g., Wilbur and 
Lipman, Proc. Natl Acad, ScL, 80:726-730, 1983) or Martinez/Needleman-Wunsch (e.g., 
Martm&z, Nucleic Acid Res., 11:4629-4634, 1983) can be used. For instance, if flie 
Martinez/Needleman-Wunsch DNA alignment is applied, the min i mum match can be set at 
9, gap penalty at 1 .10, and gap length penalty at 0.33. The resxilts can be calculated as a 

20 similarity index, equal to the sum of the matching residues divided by the sum of all residues 
and gap characters, and then multiplied by 100 to express as a percent. Similarity index for 
related genes at the nucleotide level in accordance with the present invention can be greater 
than 70%, 80%, 85%, 90%, 95%, 99%, or more. Pairs of protein sequences can be aligned 
by the Lipman-Pearson method (e.g., Lipman and Pearson, Science, 227:1435-1441, 1985) 

25 with k-tople set at 2, gap penalty set at 4, and g^ length penalty set at 12. Results can be 
expressed as pa-cent similarity index, where related genes at the amino add level in 
accordance with the preset invention can be greater than 65%, 70%, 75%, 80%, 85%, 90%, 
95%, 99%, or more. Varioiis commercial and free sources of aligmnent programs are 
available, e.g., MegAlign by DNA Star, BLAST (National Center for Biotechnology 

30 Information), BCM (Baylor College of Medicine) Launcher, eta BLAST can be used to 
calculate amino acid sequence identity, amino add sequoice homology, and nucleotide 
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sequence identity. These calculations can be made along the entire length of each of the 
target sequences which are to be compared. 

After two sequences have been aligned, a '"percent sequence identity*' can be 
detennined. For these purposes, it is convenient to refer to a Reference Sequmce and a 

5 Compared Sequence, where the Compared Sequence is compared to the Reference Sequoice. 
Percent sequence identity can be detennined according to the following formula: Percent 
Idaitity =100 [HC/R)], wherein C is the numba: of differences between die Reference 
Sequence and the Compared Sequence over the length of aUgnment between the Reference 
Sequence and the Compared Sequence where (i) each base or anaino acid in the Reference 

10 Sequence that does not have a corresponding aligned base or amino add in the Compared 

Sequence, (ii) each gap in tiie Reference Sequence, (iii) each aligned base or amino add in the 
Reference Sequence that is diflferent ftom an aligned base or amino add in the Compared 
Sequence, constitutes a diffeience; and R is the number of bases or amino acids in the 
RefCTence Sequence over the length of the alignment with the Compared Sequence witii any 

1 5 gap created in the Reference Sequence also being counted as a base or amino add. When it is 
stated that a polynucleotide sequence has a certain percentage ving 95% or more sequence 
identity along the entire length of the polynucleotide sequmce set forth in SEQ ID NO 1, 11, 
16, 25, 38, 43, 45, 47, 49, 51, or 58, it means that when the polynucleotide is shorter than the 
mentioned SEQ ID NOS, the missing bases are comted for the purposes of the calculation. 

20 Percent sequence identity can also be determined by other conventional methods, e.g., as 
described in Altschul et al.. Bull Math. Bio. 48: 603-616, 1986 and Henikoff and Henikoff, 
Proc. Natl Acad ScL USA 89:10915-10919, 1992. 

Specific polynucleotide probes 

25 A polynucleotide of the present invention can comprise any continuous nucleotide 

sequence of SEQ ID NOS 1,11,16, 25, 38, 43, 45, 47, 49, 51, and/or 58, sequences which 
share sequence identity thereto, or complements thereof The term **probe" refers to any 
substance that can be used to detect, idmtify, isolate, etc., another substance. A 
polynucleotide probe is comprised of nucldc add can be used to detect, identify, etc., other 

30 nucldc adds, such as DNA and RNA. 



wo 03/085095 PCTAJS03/09921 

-39- 

These polynucleotides can be of any desired size that is efBsctive to achieve the 
specificity desired. For example, a probe can be fix)m about 7 or 8 nucleotides to several 
thousand nucleotides, depending upon its use and purpose. For instance, a probe used as a 
primer PGR can be shorter than a probe used in an ordered array of polynucleotide probes. 

5 Probe sizes vary, and the invention is not limited in any way by flieir size, e.g., probes can be 
from about 7-2000 nucleotides, 7-1000, 8-700, 8-600, 8-500, 8-400, 8-300, 8-150, 8-100, 8- 
75, 7-50, 10-25, 14-16, at least about 8, at least about 10, 1 1, 12, 13, 14, 15, 16, 17, 18, 19, 
20, 21, 22, 23, 24, 25, 26, or more, etc. The polynucleotides can have non-naturally-occurring 
nucleotides, e.g., inosine, AZT, 3TC, etc. The polynucleotides can have 100% sequence 

10 idmtity or complementarity to a sequence of SEQ ID NOS 1, 11,16, 25, 38, 43, 45, 47, 49, 
51, and/or 58, or it can have mismatches or nucleotide substitutions, e.g., 1, 2, 3, 4, or 5 
substitutions. The probes can be single-stranded or double-stranded. 

In accordance with the present invention, a polynucleotide can be present in a kit, 
where the kit includes, e.g., one or more polynucleotides, a desired buffer (e.g., phosphate, 

1 5 tris, etc.), detection compositions, RNA or cDNA from different tissues to be used as 

controls, libraries, etc. The polynucleotide can be labeled or imlabeled, with radioactive or 
non-radioactive labels as known in the art. Kits can comprise one or more pairs of 
polynucleotides for amplifying nucleic acids specific for polynucleotides, e.g., comprising a 
forward and reverse primer effective in PGR. These include both sense and anti-sense 

20 orientations. For instance, in PCR-based methods (such as RT-PCR), a pair of primers are 
typically used, one having a sense sequence and the other having an antisense sequence. 

Another aspect of the present invention is a nucleotide sequence that is specific to, or 
for, a selective polynucleotide. The phrases "specific for" or "specific to" a polynucleotide 
have a frmctional meaning that the polynucleotide can be used to identify the presence of one 

25 or more target genes in a sample and distinguish them fix>m non-target genes. It is specific in 
the sense that it can be used to detect polynucleotides above background noise C'non-sfpecific 
binding")- A specific sequence is a defined order of nucleotides (or amino acid sequences, if 
it is a polypeptide sequmce) which occurs in the polynucleotide, e,g., in the nucleotide 
sequences of SEQ ID NOS 1, 1 1, 16, 25, 38, 43, 45, 47, 49, 51, and/or 58, and which is 

30 characteristic of that target sequence, and substantially no non-target sequences. A probe or 
mixture of probes can comprise a sequence or sequences that are specific to a plurality of 
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target sequences, e.g., where the sequence is a consensus sequence, a functional domain, etc., 
e.g., capable of recognizing a family of related genes. Such sequences can be used as probes 
in any of the methods described herein or incorporated by reference. Both sense and 
antisense nucleotide sequences are included. A specific polynucleotide according to the 
present invention can be determined routinely. 

A polynucleotide comprising a specific sequence can be used as a hybridization probe 
to identify the presence of, e.g., human or mouse polynucleotide, in a sample comprising a 
mixture of polynucleotides, e.g., on aNorthem blot. Hybridization can be performed under 
high stringent conditions (see, above) to select polynucleotides (and their complements which 
can contain the coding sequence) having at least 90%, 95%, 99%, etc., identity (i.e., 
complementarity) to the probe, but less stringent conditions can also be used. A specific 
polynucleotide sequence can also be fiised in-frame, at either its 5' or 3' end, to various 
nucleotide sequences as mentioned throughout the patent, including coding sequences for 
enzymes, detectable markers, GFP, etc, expression control sequences, etc. 

A polynucleotide probe, especially one that is specific to a polynucleotide of the 
present invention, can be used in gene detection and hybridization methods as akeady 
described. In one embodiment, a specific polynucleotide probe can be used to detect 
wheflier a particular tissue or cell-type is present in a target sample, e.g., with OTB182, 
OTB860, or LAT-1 . To carry out such a method, a selective polynucleotide can be chosen 
which is characteristic of the desired target tissue. Such polynucleotide is preferably chosen 
so that it is expressed or displayed in the target tissue, but not in other tissues which are 
present in the sample. For instance, if detection of brain tissue is desired, it may not matter 
whether the selective polynucleotide is expressed in other tissues, as long as it is not 
expressed in cells normally present in blood, e.g., peripheral blood mononuclear cells. 
Starting from the selective polynucleotide, a specific polynucleotide probe can be designed 
which hybridizes (if hybridization is fiie basis of the assay) under the hybridization conditions 
to the selective polynucleotide, whereby the presence of the selective polynucleotide can be 
determined. 

Probes which are specific for polynucleotides of the present invention can also be 
prepared using involve transcaiption-based systems, e.g., incorporatmg an RNA polymerase 
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promoter into a selective polynucleotide of the preset invention, and Hxea transcribing anti- 
sense RNA using tiie polynucleotide as a template. See, e.g., U.S. Pat. No. 5,545,522. 

Polynucleotide composition 

A polynucleotide according to the present invention can comprise, e.g., DNA, RNA, 
synthetic polynucleotide, peptide polynucleotide, modified nucleotides, dsDNA, ssDNA, 
ssRNA, dsRNA, and mixtures thereof. A polynucleotide can be single- or double-stranded, 
triplex, DNAiRNA, duplexes, comprise hairpins, and other secondary structures, etc. 
Nucleotides comprising a polynucleotide can be joined via various known linkages, e.g., 
ester, sulfemate, sulfamide, phosphorotbioate, phosphoramidate, methylphosphonate, 
carbamate, etc., depending on the deshed purpose, e.g., resistance to nucleases, sudi as 
RNAse H, improved in vivo stabiUty, etc. See, e.g., U.S. Pat No. 5,378,825. Any desired 
nucleotide or nucleotide analog can be incorporated, e.g., 6-mercaptoguanine, 8-oxo-guanine, 
etc. 

Various inodifications can be made to the polynucleotides, such as attachmg 
detectable markers (avidin, biotin, radioactive elements, fluorescent tags and dyes, en^gy 
transfer labels, energy-emitting labels, binding partners, etc.) or moieties which improve 
hybridization, detection, and^or stability. The polynucleotides can also be attached to solid 
supports, e.g., nitrocellulose, magnetic or paramagnetic microspheres (e.g., as described in 
U.S. Pat. No. 5,411,863; U.S. Pat No. 5,543,289; for instance, comprising ferromagnetic, 
supermagnetic, paramagnetic, superparamagnetic, iron oxide and polysaccharide), nylon, 
agarose, diazotized cellulose, latex solid microspheres, polyacrylamides, etc., according to a 
desired method. See, e.g., U.S. Pat Nos. 5,470,967, 5,476,925, and 5,478,893. 

Polynucleotide according to the present invention can be labeled according to any 
desired method. The polynucleotide can be labeled using radioactive tracers such as ^^P, ^^S, 
^H, or to mention some commonly used tracers. The radioactive labeling can be carried 
out according to any method, such as, for example, terminal labeling at the 3' or 5' end using 
a radiolabeled nucleotide, polynucleotide kinase (wifli or without dephosphorylation with a 
phosphatase) or a ligase (depending on tiie end to be labeled). A non-radioactive labeling can 
also be used, combining a polynucleotide of the present invention with residues having 
immunological properties (antigens, haptens), a specific afiBnity for certain reagents 
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(ligands), properties enabling detectable enzyme reactions to be completed (enzymes or 
coenzymes, enzyme substrates, or oflier substances involved in an enzymatic reaction), or 
characteristic physical properties, such as fluorescence or the emission or absorption of light 
at a desired wavelength, etc. 

5 

Nucleic acid detection methods 

Another aspect of the present invention relates to methods and processes for detecting 
polynucleotides. Detection methods have a variety of applications, including for diagnostic, 
prognostic, forensic, and research applications. To accomplish gene detection, a 

10 polynucleotide in accordance with the present invention can be used as a "probe." The temi 
**piobe" or "polynucleotide probe" has its customary meaning in the art, e.g., a polynucleotide 
which is effective to identify (e.g., by hybridization), when used in an appropriate process, 
the presence of a target polynucleotide to whidi it is designed. Identification can involve 
simply determining presence or absence, or it can be quantitative, e.g., in assessing amounts 

15 of a gene or gene transcript present in a sample. Probes can be useful in a variety of ways, 
such as for diagnostic purposes, to identify homologs, and to detect, quantitate, or isolate a 
polynucleotide of the present invention in a test sample. 

Assays can be utilized which permit quantification and/or presence/absence detection 
of a target nucleic acid in a sample. Assays can be performed at the single-cell level, or in a 

20 sample comprising many cells, where the assay is "averagmg" expression over the entire 
collection of cells and tissue present in the sample. Any sxiitable assay format can be used, 
including, but not limited to, e.g.. Southern blot analysis. Northern blot analysis, polymerase 
chain reaction CTCR") (e.g., Saiki et al.. Science, 241:53, 1988; U.S. Pat. Nos. 4,683,195, 
4,683,202, and 6,040,166; PCR Protocols: A Guide to Methods and Applications, Innis et al., 

25 eds.. Academic Press, New York, 1990), reverse transcriptase polymerase chain reaction 

("RT-PCR"), anchored PCR, rapid amplification of cDNA ends C15LACE") (e.g., Schaefer in 
Gene Cloning and Analysis: Current Innovations, Pages 99-1 15, 1997), ligase chain reaction 
("LCR") (EP 320 308), one-sided PCR (Ohara et al., Proc. Natl Acad. ScL, 86:5673-5677, 
1989), indexing methods (e.g., U.S. Pat No. 5,508,169), in situ hybridization, differential 

30 display (e.g., Liang et al., Nucl Acid. Res., 21:3269-3275, 1993; U.S. Pat Nos. 5,262,311, 
5,599,672 and 5,965,409; W097/18454; Prashar and Weissman, Proc. Natl. Acad. Set, 
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93:659-663, and U.S. Pat Nos. 6,010,850 and 5,712,126; Welsh et al.. Nucleic Acid Res., 
20:4965-4970, 1992, and U.S. Pat No. 5,487,985) and other RNA fingerprinting techniques, 
nucleic acid sequence based amplification ('TsTASB A") and other transcription based 
ampUfication systems (e.g., U.S. Pat. Nos. 5,409,818 and 5,554,527; WO 88/10315), 

5 polynucleotide arrays (e.g., U.S. Pat Nos. 5,143,854, 5,424,186; 5,700,637, 5,874,219, and 
6,054,270; PCX WO 92/10092; PCT WO 90/15070), Qbeta Replicase (PCT/US87/00880), 
Strand Displacement AmpUfication ("SDA"), Repair Chain Reaction CRCR'O, nuclease 
protection assays, subtraction-based methods, Rapid-Scan™, etc. Additional usefiil methods 
include, but are not limited to, e.g., template-based amplification methods, competitive PGR 
. 10 (e.g., U.S. Pat No. 5,747,251), redox-based assays (e.g., U.S. Pat No. 5,871,918), Taqman- 
based assays (e.g., Holland et al., Proc, Natl. Acad, Sfci., 88:7276-7280, 1991; U.S. Pat Nos. 
5,210,015 and 5,994,063), real-time fluorescence-based monitoring (e.g., U.S. Pat 
5,928,907), molecular energy transfer labels (e.g., U.S. Pat Nos. 5,348,853, 5,532,129, 
5,565,322, 6,030,787, and 6,1 17,635; Tyagi and Kramer, Nature Biotech., 14:303-309, 

15 1 996). Any method suitable for single cell analysis of gene or protein expression can be 

used, including in situ hybridization, immunocytochemistry, MACS, FACS, flow cytometry, 
etc. For single cell assays, expression products can be measured using antibodies, PGR, or 
other types of nucleic acid amplification (e.g., Brady et al., Methods Mol & Cell BioL 2, 17- 
25, 1990; Eberwine et .al., 1992, Proa. Natl Acad, Set., 89, 3010-3014, 1992; U.S. Pat No. 

20 5,723,290). These and other methods can be carried out conventionally, e.g., as described in 
the mentioned publications. 

Many of such methods may require that the polynucleotide is labeled, or comprises a 
particular nucleotide type usefiil for detection. The present invention includes such modified 
polynucleotides tiiat are necessary to carry out such methods. Thus, polynucleotides can be 

25 DNA, RNA, DNArRNA hybrids, PNA, etc., and can comprise any modification or 
substituent which is effective to achieve detection. 

Detection can be desirable for a variety of differ^it purposes, including research, 
diagnostic, prognostic, and forensic. For diagnostic purposes, it may be desirable to identify 
the presence or quantity of a polynucleotide sequence in a sample, where Ihe sample is 

30 obtained firom tissue, cells, body fluids, etc. In a preferred method as described in more 
detail below, the present invention relates to a method of detecting a polynucleotide 



wo 03/085095 PCTAJS03/0992 1 

-44- 

comprising, contacting a target polynucleotide in a test sample with a polynucleotide probe 
under conditions efiFective to achieve hybridization between the target and probe; and 
detecting hybridization. 

Any test sample in which it is desired to identify a polynucleotide or polypeptide 
5 thereof can be used, including, e.g., blood, urine, saliva, stool (for extracting nucleic acid, 
see, e.g., U.S. Pat No. 6,177,251), swabs comprising tissue, biopsied tissue, tissue sections, 
cultured cells, etc. 

Detection can be accomplished in combination with polynucleotide probes for other 
genes, e.g., genes which are expressed in other disease states, tissues, cells, such as brain, 
1 0 heart, kidney, spleen, thymus, liver, stomach, small intestine, colon, muscle, limg, testis, 
placenta, pituitary, thyroid, skin, adrenal gland, pancreas, salivary gland, uterus, ovary, 
prostate gland, peripheral blood cells (T-cells, lymphocytes, etc.), embryo, breast, fat, adult 
and embryonic stem cells, specific cell-types, such as endothelial, epithelial, myocytes, 
adipose, etc. 

1 5 Polynucleotides can be used in wide range of methods and compositions, including 

for detecting, diagnosing, stagmg, grading, assessing, prognosticating, etc. diseases and 
disorders associated with polynucleotides, for monitoring or assessing therapeutic and/or 
preventative measures, in ordered arrays, etc. Any method of detecting genes and 
polynucleotides can be used; certairdy, the present invention is not to be linaited how such 

20 methods are implemented. 

Along these lines, the present invention relates to methods of detecting 
polynucleotides in a sample comprising nucleic acid. Such methods can comprise one or 
more the following steps in any effective order, e.g., contacting said sample with a 
polynucleotide probe under conditions effective for said probe to hybridize specifically to 

25 nucleic acid in said sample, and detecting the presence or absence of probe hybridized to 

nucleic acid in said sample, wherein said probe is a polynucleotide which is SEQ ID NOS 1, 
11, 16, 25, 38, 43, 45, 47, 49, 51, and/or 58, a polynucleotide having, e.g., about 70%, 80%, 
85%, 90%, 95%, 99%, or more sequence idmtity thereto, effective or specific fragments 
th^eof, or complements tiiereto. The detection method can be applied to any sample, e.g., 

30 cultured primary, secondary, or established cell lines, tissue biopsy, blood, urine, stool, 
cerebral spinal fluid, and other bodily fluids, for any purpose. 
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Contacting the sample with probe can be carried out by any effective means in any 
effective environment It can be accomplished in a solid, liquid, fix)zen, gaseous, amorphous, 
solidified, coagulated, colloid, etc., mixtures thereof, matrix. For instance, a probe m an 
aqueous medium can be contacted with a sample which is also in an aqueoxis medium, or 

5 which is affixed to a solid matrix, or vice-versa. 

Generally, as used throughout the specification, the term "effective conditions" 
means, e.g., the particular milieu in which the desired effect is achieved Such a milieu, 
includes, e.g., appropriate buffers, oxidizing agents, reducing agents, pH, co-factors, 
temperature, ion concentrations, suitable age and/or stage of cell (such as, in particular part of 

10 the cell cycle, or at a particular stage where particular genes are being expressed) where cells 
are being used, culture conditions (including substrate, oxygen, carbon dioxide, etc.). When 
hybridization is the chosen means of achieving detection, the probe and sample can be 
combined such that the resulting conditions are functional for said probe to hybridize 
specifically to nucleic acid in said sample. 

1 5 The phrase "hybridize specifically" indicates that the hybridization between single- 

stranded polynucleotides is based on nucleotide sequence complementarity. The effective 
conditions are selected such that the probe hybridizes to a preselected and/or definite target 
nucleic add in the sample. For instance, if detection of a polynucleotide set forth in SEQ ID 
NOS 1, 1 1, 16, 25, 38, 43, 45, 47, 49, 51, and/or 58 is desired, a probe can be selected which 

20 can hybridize to such target gene under high stringent conditions, without significant 

hybridization to other genes in the sample. To detect homologs of a polynucleotide set forth 
in SEQ ID NOS 1, 11, 16, 25, 38, 43, 45, 47, 49, 51, and/or 58, the effective hybridization 
conditions can be less stringent, and/or the probe can comprise codon degeneracy, such that a 
homolog is detected in the sample. 

25 As ahready mentioned, the methods can be carried out by any effective process, e.g., 

by Northern blot analysis, polymerase chain reaction (PGR), reverse transcriptase PGR, 
RAGE PGR, in situ hybridization, etc., as indicated above. When PGR based techniques are 
used, two or more probes are generally used. One probe can be specific for a defined 
sequence which is characteristic of a selective polynucleotide, but the other probe can be 

30 specific for the selective polynucleotide, or specific for a more general sequence, e,g., a 

sequence such as polyA which is characteristic of mRNA, a sequence which is specific for a 
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promotear, ribosome binding site, or oflier transcriptional features, a consensus sequence (e.g,, 
representing a functional domain). For the former aspects, 5* and 3' probes (e.g., polyA, 
Kozak, etc.) are preferred which are capable of specifically hybridizing to the ends of 
transcripts. When PGR is utilized, the probes can also be referred to as ''primers" in that they 
5 can prime a DNA polymerase reaction. 

In addition to testing for the presence or absence of polynucleotides, the present 
invention also relates to determining the amounts at which polynucleotides of the present 
invention are expressed in sample and determining the differential expression of such 
polynucleotides in samples. Such methods can involve substantially the same steps as 
1 0 described above for presence/absence detection, e.g., contacting with probe, hybridizing, and 
detecting hybridized probe, but using more quantitative methods and/or comparisons to 
standards. The amount of hybridization between the probe and target can be determined by 
any suitable methods, e.g., PGR, RT-PCR, RACE PGR, Norfhem blot, polynucleotide 
microairays, Rapid-Scan, etc., and includes both quantitative and qualitative measurements. 

15 

Methods of identifying polymoiphisms, mutations, etc., of polynucleotides 

Polynucleotides of the present invention can also be utilized to identify mutant alleles, 
SNPs, gene rearrangements and modifications, and other polymorphisms of the wild-type 
gene. Mutant alleles, polymorphisms, SNPs, etc., can be identified and isolated from 

20 subjects with diseases that are known, or suspected to have, a genetic component. 

Identification of such genes can be carried out routinely (see, above for more guidance), e.g., 
using PGR, hybridization techniques, direct sequencing, mismatch reactions (see, e.g., 
above), RFLP analj^is, SSCP (e.g., Orita et al., Proc, Natl Acad. ScL, 86:2766, 1992), etc., 
where a polynucleotide having a sequence selected from SEQ ID NOS 1, 1 1, 16, 25, 38, 43, 

25 45, 47, 49, 51, and/or 58 is used as a probe. The selected mutant alleles, SNPs, 

polymorphisms, etc., can be used diagnostically to determine whether a subject has, or is 
susceptible to a disorder associated with polynucleotides, as well as to design therapies and 
predict ttie outcome of the disorder. Methods involve, e.g., diagnosing a disorder associated 
with polynucleotides or determining susceptibilify to a disorder, comprising, detecting the 

30 presence of a mutation in a gene represented by a polynucleotide selected from SEQ ID NOS 
1,11, 16, 25, 38, 43, 45, 47, 49, 51, and/or 58. The detecting can be carried out by any 
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eflfective method, e.g., obtaining cells from a subject, detennining the gene sequmce or 
structure of a target gene (using, e.g., mRNA, cDNA, genomic DNA, etc), comparing the 
sequence or structure of the target gene to the structure of the normal gene, whereby a 
difference in sequence or structure indicates a mutation in the gene in the subject. 

5 Polynucleotides can also be used to test for mutations, SNPs, polymorphisms, etc., e.g., using 
mismatch DNA repair technology as described in U.S. Pat. No. 5,683,877; U.S. Pat. No. 
5,656,430; Wu et al., Proc, Natl Acad ScL, 89:8779-8783, 1992. 

The present invention also relates to methods of detecting polymorphisms in genes of 
the present invention, comprising, e,g., comparing the structure of: genomic DNA, mRNA, 

10 cDNA, etc., with the structure of a polynucleotide set forth in SEQ ID NOS 1, 11, 16,25,38, 
43, 45, 47, 49, 51, or 58. The methods can be carried out on a sample from any source, e.g., 
cells, tissues, body fluids, blood, urine, stool, hair, egg, spenn,cerebral spmal fluid, etc. 

These methods can be implemented in many different ways. For example, 
"comparing the structure" steps include, but are not limited to, comparing restriction maps, 

15 nucleotide sequences, amino acid sequences, RFLPs, Dnase sites, DNA mefliylation 
fingerprints (e.g., U.S. Pat. No, 6,214,556), protein cleavage sites, molecular weights, 
electrophoretic mobiUties, charges, ion mobility, etc.. The term "structure" can refer to any 
physical characteristics or configurations which can be used to distinguish between nucleic 
acids and polypeptides. The mediods and instruments used to accomplish the comparing step 

20 depends upon the physical characteristics which are to be compared. Thus, various 

techniques are contemplated, including, e.g., sequencing machines (both amino acid and 
polynucleotide), electrophoresis, mass spectrometer (U.S. Pat. Nos. 6,093,541, 6,002,127), 
Uquid chromatography, HPLC, etc. 

To carry out such methods, "all or part" of the gene or polypeptide can be compared. 

25 For example, if nucleotide sequencing is utilized, the entire gene can be sequenced, including 
promoter, introns, and exons, or only parts of it can be sequenced and compared, e.g., exon 1, 
exon 2, etc. 

Mutagenesis 

30 Mutated polynucleotide sequences of the present invention are useful for various 

purposes, e.g., to create mutations of the polypq)tides they encode, to identify functional 
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regions of genomic DNA, to produce probes for screCTing libraries, etc. Mutagenesis can be 
carried out routinely according to any effective method, e.g., oligonucleotide-directed (Smith, 
M., Ann. Rev, Genet. 19:423-463, 1985), degenerate oligonucleotide-directed (Hill et al.. 
Method Enzymology, 155:558-568, 1987), region-specific (Myers et al.. Science, 229:242- 

5 246, 1985; Derbyshire et al.. Gene, 46:145, 1986; Ner et al., DNA, 7:127, 1988), linker- 
scanning (McKnight and Kingsbury, Science, 217:316-324, 1982), directed using PGR, 
recursive ensemble mutagenesis (Arkin and Yourvan, Proc. Natl Acad, Sci,, 89:781 1-7815, 
1992), random mutagenesis (e.g., U.S. Pat. Nos. 5,096,815; 5,198,346; and 5,223,409), site- 
directed mutagenesis (e.g., WaldOT et al.. Gene, 42:133, 1986; Bauer et al.. Gene, 37:73, 

10 1985; Craik, Bio Techniques, January 1985, 12-19; Smith et al.. Genetic Engineering: 

Principles and Methods, Plenum Press, 1981), phage display (e.g., Lowman et al., Biochem. 
30:10832-10837, 1991; Ladner et al., U.S. Pat No. 5,223,409; Huse, WIPO PubUcation WO 
92/06204), etc. Desired sequences can also be produced by the assembly of target sequences 
using mutually priming oligonucleotides (Uhhnann, Gene, 71:29-40, 1988). For directed 

1 5 mutagenesis methods, analysis of the three-dimensional structure of the polynucleotides 
polypeptide can be used to guide and facilitate making mutants which effect polypeptide 
activity. Sites of substrate-enzyme interaction or other biological activities can also be 
determined by analysis of crystal structure as determined by such techniques as nuclear 
magnetic resonance, crystallography or photoaflSnity labeling. See, for example, de Vos et 

20 al.. Science 255:306-3 12, 1992; Smith et al., J. Mol. Biol. 224:899-904, 1992; Wlodaver et 
al., FEBS Lett. 309:59-64, 1992. 

In addition, libraries of polynucleotides and firagments thereof can be used for 
screening and selection of polynucleotides variants. For instance, a library of coding 
sequences can be generated by treating a double-stranded DNA with a nuclease under 

25 conditions where the nickmg occurs, e.g,, only once per molecule, denaturing the double- 
stranded DNA, renaturing it to for double-stranded DNA that can include sense/antisense 
pairs from different nicked products, removing single-stranded portions firom reformed 
duplexes by treatment with SI nuclease, and Ugatmg the resulting DNAs into an expression 
vector. By this method, expression libraries can be made comprising '*mutagenized" 

30 polynucleotides. The entire coding sequence or parts thereof can be used. 
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Polynucleotide expression, polypeptides produced fliereby, and specific-binding partners 
thereto. 

A polynucleotide according to the present invention can be expressed in a variety of different 
systems, in vitro and in vivo, according to the desired purpose. For example, a 

5 polyuucleotide can be inserted into an expression vector, introduced into a desired host, and 
cultured under conditions effective to achieve expression of a polypeptide coded for by the 
polynucleotide, to search for specific binding partners. Effective conditions include any 
culture conditions which are suitable for achieving production of the polypeptide by the host 
cell, including effective temperatures, pH, medium, additives to the media in which the host 

10 cell is cultured (e.g., additives which amplify or induce expression such as butyrate, or 
methotrexate if the coding polynucleotide is adjacmt to a dhfir gene), cycloheximide, cell 
densities, culture dishes, etc. A polynucleotide can be introduced into the cell by any 
effective method including, e.g., naked DNA, calcium phosphate precipitation, 
electroporation, injection, DEAE-Dextran mediated transfection, fiision with liposomes, 

1 5 association with agents which enhance its uptake into cells, viral transfection. A cell into 
which a polynucleotide of the present invention has been introduced is a transformed host 
cell. The polynucleotide can be extrachromosomal or integrated into a chromosome(s) of the 
host cell. It can be stable or transient An expression vector is selected for its compatibility 
with the host cell. Host cells include, mammalian cells, e.g., COS, CVl, BHK, CHO, HeLa, 

20 LTK, NIH 3T3, cardiac or heart cells, such as Wl (Wang et al.. In vitro Cell Dev., 27:63-74, 
1991), MC29, cardiac fibroblasts (e.g., Wang et al., Tiss Cell., 33:86-96, 2001), cardiac 
microvascular endothelial cells (e.g. JoUow et al., Transplantation, 68:430-439, 1999), T/G 
HA-VSMC (CRL-1999), H9c2(2-1) (CRL-1446), P19 (CRJL-1825) , CNS neural stem cells 
(e.g., U.S. Pat. No. 6,103,530), IMR-32, A172 (ATCC CRL-1620), T98G (ATCC CRL- 

25 1690), CCF-STTGl (ATCC CRL-1718), DBTRG-05MG (ATCC CKL.2020), PFSK-1 

(ATCC CRI^2060), SK-N-AS and other SK cell lines (ATCC CRI^2137), CHP-212 (ATCC 
CRL-2273), RG2 (ATCC CRL-2433), HCN-2 (ATCC CRI^10742), U-87 MG and other U 
MG cell lines (ATCC HTB-14), D283 Med (ATCC HTB-185), PC12, Neuro-2a (ATCC 
CCL-131), muscle ceUs lines, such as RD (CCL-136),.G-7, G-8, C2C12, estabUshed and 

30 primary brain, heart, or muscle cells, G-402 (ATCC CKL-1440), ACHN (ATCC CRI^161 1), 
Vero (ATCC CCL-Sl), 786-0 (ATCC CRL.1932), 769-P (ATCC CRL-1933), CCD 1103 
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KIDTr (ATCC CR1^2304), CCD 1 105 KIDTr (ATCC CRL-2305), Hs 835T (ATCC CRL- 
7569), Hs 926.T (ATCC CRL-7678), Caki-1 (ATCC HTB-46), Caki-2 (ATCC HTB-47), SW 
839 (ATCC HTB-49), LLC-MK2 (ATCC CCL-7), BHK-21 (ATCC CCL-10), MDBK, CV- 
1, (ATCC CRL-1573), KNRK (ATCC CRL-1569), NRK-49F (ATCC CRL-1570), A-704 

5 (ATCC HTB-45), and other established and primary kidney lines, CNS neural stem cells 
(e.g,, U.S. Pat No. 6,103,530), IMR-32, A172 (ATCC CRL-1620), T98G (ATCC CRL- 
1690), CCF-STTGl (ATCC CRL-1718), DBTRG-05MG (ATCC CRL-2020), PFSK-1 
(ATCC CRI^2060), SK-N-AS and other SK cell lines (ATCC CRL-2137), CHP-212 (ATCC 
CRL-2273), RG2 (ATCC CRL-2433), HCN-2 (ATCC CRL-10742), U-87 MG and other U 

10 MG ceU lines (ATCC HTB.14), D283 Med (ATCC HTB-185), PC12, Neuro-2a (ATCC 
CCL-131), and other established and primary brain cell lines. Hep G2 (ATCC NO. HB- 
8065), SK-HEP-1 (ATCC NO HTB.52), H2.35 (ATCC NO CRL-1995), CD-I (ATC NO 
CRL-2254), C3A (ATCC NO CR1^10741), FL83B (ATCC NO CRL.2390), WRL 68 
(ATCC NO CL-48), Hep 3B (ATCC NO HB-8064), insect cells, such as S© (S. frugipeda) 

15 and Drosophila, bacteria, such as E. coli. Streptococcus, bacillus, yeast, such as 

Sacharomyces, S. cerevisiae, fungal cells, plant cells, embryonic or adult stem cells (e.g., 
mammalian, such as mouse or human). 

Expression control sequences are similarly selected for host compatibility and a 
desired purpose, e.g., high copy number, high amounts, induction, ampUfication, controlled 

20 expression. Other sequences which can be employed include enhancers such as from SV40, 
CMV, RSV, inducible promoters, cell-type specific elements, or sequences which allow 
selective or specific cell expression. Promoters that can be used to drive its expression, 
include, e.g., the endogenous promoter, MMTV, SV40, trp, lac, tac, or T7 promoters for 
bacterial hosts; or alpha factor, alcohol oxidase, or PGH promoters for yeast. RNA 

25 promoters can be used to produced KNA transcripts, such as T7 or SP6. See, e.g., Melton et 
al.. Polynucleotide Res., 12(18):7035-7056, 1984; Dunn and Studier. J. Mol Bip., 166:477- 
435, 1984; U.S. Pat. No. 5,891,636; Studier et al.. Gene Expression Technology, Methods in 
Enzymology, 85:60-89, 1987. In addition, as discussed above, translational signals (including 
in-fiame insertions) can be included. 

30 When a polynucleotide is expressed as a heterologous gene in a transfected cell line, 

the gene is introduced into a cell as described above, under effective conditions in which the 
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gene is expressed. The term *Tieterologous" means that the gene has been introduced into flie 
cell line by the '*hand-of-man." Introduction of a gene into a cell line is discussed above. 
The transfected (or transformed) cell expressing the gene can be lysed or the cell line can be 
used intact 

5 For expression and other purposes, a polynucleotide can contain codons found in a 

naturally-occurring gene, transcript, or cDNA, for example, e.g., as set forth in SEQ ID NOS 
1,11, 16, 25, 38, 43, 45, 47, 49, 51, and/or 58, or it can contain diegenerate codons coding for 
the same anuno acid sequences. For instance, it may be desirable to change the codons in the 
sequence to optimize the sequence for expression in a desired host See, e.g., U.S. Pat. Nos. 

10 5,567,600 and 5,567,862. 

A polypeptide according to the present invention can be recovered from natural 
sources, transformed host cells (culture medium or cells) according to tiie usual methods. 
Another approach is express the polypeptide recombinantly with an aflSnity tag (Flag epitope, 
HA epitope, myc epitope, 6xHis, maltose binding protein, chitinase, etc) and then purify by 

1 5 anti-tag antibody-conjugated aflBnity chromatography. 

The present invention also relates to polypeptides, e.g., an isolated human polypeptide 
comprising or having the amino acid sequence set forth in SEQ ID NOS 1, 11, 16, 25, 38, 43, 
45, 47, 49, 51, and/or 58, an isolated human polypeptide comprising an amino acid sequence 
havmg 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or more amino acid 

20 sequence identity to the amino acid sequence set forth in SEQ ID NOS 1,11, 16, 25, 38, 43, 
45^ 47^ 49, 51, and/or 58, and optionally having one or more activities. Fragments can also 
used, e.g., to produce antibodies or other iomiune responses, as competitors to any activity. 
These fragments can be referred to as being "specific for"' polynucleotides. The latter phrase, 
as already defined, indicates that the peptides are characteristic of polynucleotides, and that 

25 the defined sequences are substantially absent from all other protdn types. Such 

polypeptides can be of any size which is necessary to confer specificity, e.g., 5, 8, 10, 12, 15, 
20, etc. 

The present invention also relates to specific-binding partners. These include 
antibodies which are specific for polypeptides encoded by polynucleotides of the present 
30 iQvention, as well as other binding-partners which interact with polynucleotides and 

polypeptides of the present invention. Protein-protein interactions between polynucleotides 
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and other polypeptides and binding partners can be identified using any suitable mefliods, 
e.g., protein binding assays (e.g., filtration assays, chromatography, etc.) , yeast two-hybrid 
system (Fields and Song, Nature, 340: 245-247, 1989), protein arrays, gel-shifl assays, FRET 
(fluorescence resonance energy transfer) assays, etc. Nucleic acid interactions (e.g., protein- 

5 DNA or protein-RNA) can be assessed using gel-shift assays, e.g., as carried out in U.S. Pat. 
No. 6,333,407 and 5,789,538. 

Antibodies, e.g., polyclonal, monoclonal, recombinant, chimeric, humanized, single- 
chain, Fab, and Augments thereof, can be prepared according to any desired method. See, 
also, screening recombinant immunoglobulin libraries (e.g., Orlandi et al., Proc, Natl Acad. 

10 ScL, 86:3833-3837, 1989; Huse et al., Science, 256: 1275-1281, 1989); in vitro stimulation of 
lymphocyte populations; Winter and MUstem, Nature, 349: 293-299, 1991. The antibodies 
can be IgM, IgG, subtypes, IgG2a, IgGl, etc. Antibodies, and immune responses, can also be 
generated by administering naked DNA See, e.g., U.S. Pat Nos. 5,703,055; 5,589,466; 
5,580,859. Antibodies can be used fix)m any source, including, goat, rabbit, mouse, chicken 

1 5 (e.g., IgY; see, Duan, WO/029444 for methods of making antibodies in avian hosts, and 

harvesting the antibodies firom the eggs). An antibody specific for a polypeptide means that 
the antibody recognizes a defined sequence of amino acids within or including the 
polypeptide. Other specific binding partners include, e.g., aptamers and PNA. Antibodies 
can be prepared against specific epitopes or domains of polynucleotides. 

20 Any form or type of antibody can be prepared and used. For example, antibodies can 

be humanized, e.g., where they are to be used therapeutically. Another form of an antibody 
firagment is a peptide coding for a single complementarity-determining region (CDR). CDR 
pq)tides ("minimal recognition units") can be obtained by constructing genes encoding the 
CDR of an antibody of interest. The term "antibody** as used herein includes intact 

25 molecules as well as fi-agments thereof, such as Fab, F(ab02, and Fv which are capable of 

binding to an epitopic determinant Such antibody firagments retain some ability to selectively 
bmd with its antigen or receptor. The tran "epitope" refers to an antigenic detemiinant on an 
antigen to which the paratope of an antibody binds. Epitopic determinants usually consist of 
chemically active surface groupings of molecules such as anouno adds or sugar side chains 

30 and usually have specific three dimensional structural characteristics, as well as specific 
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charge characteristics. Antibodies can be prepared against specific epitopes or polypeptide 
domains. 

Antibodies which bind to polynucleotides polypeptide of ttie present invention can 
be prepared using an intact polypeptide or Augments containing small peptides of mterest as 
the immunizing antigen. For example, it maybe desirable to produce antibodies that 
specifically bind to the N- or C-terminal domains of polynucleotides. The polypeptide or 
peptide used to immunize an animal which is derived from translated cDNA or chemically 
syntiiesized which can be conjugated to a carrier protein, if desired. Anti-idiotype technology 
can also be used to produce invention monoclonal antibodies which nmnic an epitope. 

Methods of detecting polypeptides 

Polypeptides coded for by polynucleotides of flie present invention can be detected, 
visualized, determined, quantitated, etc. according to any effective method, usefiil metiiods 
include, e.g., but are not limited to, immunoassays, RIA (radioimmunassay), ELISA, 
(enzyme-linked-immunosorbent assay), hnmunoflourescence, flow cytometry, histology, 
electron microscopy, light microscopy, in situ assays, immunopredpitation. Western blot 

hnmunoassays may be carried in liquid or on biological support For instance, a 
sample (e.g., blood, stool, urine, ceUs, tissue,cerebral spinal fluid, body fluids, etc.) can be 
brought in contact with and immobilized onto a solid phase support or carrier such as 
nitrocellulose, or other solid support that is capable of immobilizing cells, cell particles or 
soluble proteins. The support may then be washed with suitable buffers followed by 
treatment with the detectably labeled polynucleotides specific antibody. The solid phase 
support can then be washed with a buffer a second time to remove unbound antibody. The 
amount of bound label on solid support may tiien be detected by conventional means. 

A "solid phase support or carrier" includes any support capable of binding an antigen, 
antibody, or other specific binding partner. Supports or carriers include glass, polystyrene, 
polypropylene, polyethylene, dextran, nylon, amylases, natural and modified celluloses, 
polyacrylaimdes, and magnetite. A support material can have any structural or physical 
configuration. Thus, the support configuration may be spherical, as in a bead, or cylindrical, 
as in the inside surface of a test tube, or the external surface of a rod. Alternatively, the 
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surface may be flat such as a sheet, test strip, etc. Preferred supports include polystyrene 
beads 

One of the many ways in which gene peptide-specific antibody can be detectably 
labeled is by linking it to an enzyme and using it in an enzyme immunoassay (EIA). See, 

5 e.g., Voller, A., *The Enzyme Linked hmnxmosorbent Assay (ELISA)," 1978, Diagnostic 
Horizons 2, 1-7, Microbiological Associates Quarterly Publication, Walkersville, Md.); 
VoUer, A. et al., 1978, J. Qin. Pathol. 31, 507-520; Butler, J, E., 1981, Meth. Enzymol. 73, 
482-523; Maggio, E. (ed.), 1980, Enzyme hmnunoassay, CRC Press, Boca Raton, Fla.. The 
enzyme which is bound to the antibody will react with an appropriate substrate, preferably a 

10 chromog^c substrate, in such a manner as to produce a chemical moiety that can be 

detected, for example, by spectrophotometric, fluorimetric or by visual means. Enzymes that 
can be used to detectably label the antibody include, but are not limited to, malate 
dehydrogenase, staphylococcal nuclease, delta-5-steroid isomerase, yeast alcohol 
dehydrogenase, .alpha.-glycerophosphate, dehydrogenase, triose phosphate isomerase, 

15 horseradish peroxidase, alkaline phosphatase, asparaginase, glucose oxidase, .beta.- 
galactosidase, ribonuclease, urease, catalase, glucose-6-phosphate dehydrogenase, 
glucoamylase and acetylcholinesterase. The detection can be accomplished by colorimetric 
methods that employ a chromogenic substrate for the enzyme. Detection may also be 
accomplished by visual comparison of the extent of enzymatic reaction of a substrate in 

20 comparison with similarly prepared standards. 

Detection may also be accomplished using any of a variety of other immunoassays. 
For example, by radioactively labeling the antibodies or antibody fragments, it is possible to 
detect polynucleotides peptides through the use of a radioimmunoassay (RIA). See, e.g., 
Weintraub, B., Principles of Radioimmunoassays, Seventh Training Course on RadioUgand 

25 Assay Techniques, The Endocrine Society, March, 1986. The radioactive isotope can be 
detected by such means as the use of a gamma counter or a scintillation counter or by 
autoradiography. 

It is also possible to label the antibody with a fluorescent compomd. When the 
fluorescently labeled antibody is exposed to light of the proper wave length, its presence can 
30 then be detected due to fluorescence. Among the most commonly used fluorescent labeling 
compounds are fluorescein isothiocyanate, rhodamine, phycoerythrin, phycocyanin. 



wo 03/085095 PCTAJS03/09921 

-55- 

allophycocyanin, o-phfhaldehyde and fluorescamine. The antibody can also be detectably 
labeled using fluorescence emitting metals such as fliose in the lanthanide series. These 
metals can be attached to the antibody using such metal chelating groups as 
diethylenetriaminepentacetic acid (DTP A) or ethylenediaminetetraacetic acid (EDTA). 

5 The antibody also can be detectably labeled by coupling it to a chemiluminescent 

compound. The presence of the chemiluminescent-tagged antibody is then deteraiined by 
detecting the presence of luminescence that arises during the course of a chemical reaction. 
Examples of useful chemiluminescent labeling compounds are luminol, isoluminol, 
theromatic acridinium ester, imidazole, acridinium salt and oxalate ester. 

1 0 Likewise, a bioluminescent compound may be used to label the antibody of the 

present invention. Bioluminescence is a type of chemiluminescence found in biological 
systems in which a catalytic protein increases the efficiency of the chemilu m inescent 
reaction. The presence of a bioluminescent protein is determined by detecting the presence of 
luminescence. Important bioluminescent compounds for purposes of labeling are luciferin, 

1 5 ludferase and aequoiin. 



Diagnostic 

The present invention also relates to methods and compositions for diagnosing a 
disorder, or determining susceptibility to a disorder, using polynucleotides, polypeptides, and 

20 specific-binding partners of the present invention to detect, assess, determine, etc., 

polynucleotides of the present invention. In such methods, the gene can serve as a marker 
for the disorder, e.g., where the gene, when mutant, is a direct cause of the disorder; where 
the gene is affected by another gene(s) which is directly responsible for the disorder, e.g., 
when the gene is part of the same signaling pathway as the directly responsible gene; and, 

25 where the gene is chromosomally linked to the gene(s) directly responsible for the disorder, 
and segregates with it Many other situations are possible. To detect, assess, detOTnine, etc., 
a probe specific for the gene can be employed as described above and below. Any method of 
detecting and/or assessing the gene can be used, including detecting expression of the gene 
using polynucleotides, antibodies, or other specific-binding partners. 

30 The present invention relates to methods of diagnosing a disorder associated wifli a 

polynucleotide of the present invention, or determining a subject's susceptibiUty to such 
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disorder, comprising, e.g., assessing the expression of polynucleotide of the present invention 
in a tissue sample comprising tissue or cells suspected of having the disorder. The phrase 
"diagnosing" indicates that it is determined whether the sample has the disorder. A 
"disorder"' means, e.g., any abnormal condition as in a disease or malady. ^Determining a 

5 subject's susceptibility to a disease or disorder" indicates that the subject is assessed for 
whethCT s/he is predisposed to get such a disease or disorder, where the predisposition is 
indicated by abnormal expression of the gene (e.g., gene mutation, gene expression pattern is 
not nomial, etc.). Predisposition or susceptibility to a disease may result when a such disease 
is influenced by epigenetic, environmental, etc., fectors. Diagnosing includes prenatal 

10 screening where samples from the fetus or embryo (e.g., via amniocentesis or CV sampUng) 
are analyzed for the expression of the gene. 

By the phrase "assessing expression of a gene or polynucleotide of the present 
invention," it is meant that the functional status of the gene is evaluated. This includes, but 
is not limited to, measuring expression levels of said gene, determining the genomic structure 

15 of said gene, determining the mRNA structure of transcripts from said gene, or measuring the 
expression levels of polypeptide coded for by said gene. Thus, the tenn "assessing 
expression" includes evaluating the all aspects of the transoiptional and translational 
machinery of the gene. For instance, if a promoter defect causes, or is suspected of causing, 
the disorder, then a sample can be evaluated (i.e., "assessed") by looking (e.g., sequencing or 

20 restriction mapping) at the promoter sequence in the gene, by detecting transcription products 
(e.g., RNA), by detecting translation product (e.g., polypeptide). Any measure of whether the 
gene is functional can be used, including, polypeptide, polynucleotide, and functional assays 
for the gene's biological activity. 

In making the assessment, it can be useful to compare the results to a normal gene, 

25 e.g., a gene which is not associated with the disorder. The nature of the comparison can be 
determined routinely, depending upon how the assessing is accomplished. If, for example, 
the mRNA levels of a sample is detected, then the mKNA levels of a normal can serve as a 
comparison, or a gene which is known not to be affected by the disorder. Methods of 
detecting mRNA are well known, and discussed above, e.g., but not limited to, Northem blot 

30 analysis, polymerase chain reaction (PGR), reverse transcriptase PGR, RACE PGR, etc. 
Similarly, if polypeptide production is used to evaluate the gene, then the polyp^tide in a 
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nonnal tissue sample can be used as a comparison, or, polypeptide from a different gene 
whose expression is known not to be affected by the disorder. These are only examples of 
how such a method could be carried out 

The present invention relates to methods of identifying a genetic basis for a disease or 
disease-susceptibility, comprising, e.g., determining the association of a disease or disease- 
susceptibility with a gene of the present invention. An association between a disease or 
disease-susceptibility and nucleotide sequence includes, e.g., establishing (or finding) a 
correlation (or relationship) between a DNA marker (e.g., gene, VNTR, polymorphism, EST, 
etc.) and a particular disease state. Once a relationship is identified, the DNA marker can be 
utilized in diagnostic tests and as a drug target. Any region of the gene can be used as a 
source of the DNA marker, exons, introns, intergenic regions, etc. 

Human linkage maps can be constructed to establish a relationship between a gene 
and a disease or condition. Typically, polymorphic molecular markers (e.g., STRP's, SNP's, 
RFLP's, VNTR's) are identified within the region, linkage and map distance between the 
maikers is then established, and then linkage is established between phenotype and the 
various individual molecular markers. Maps can be produced for an individual family, 
selected populations, patient popxilations, etc. In general, these methods involve identifying a 
marker associated with the disease (e.g., identifying a polymorphism in a family which is 
linked to the disease) and then analyzing the surrounding DNA to identity the gene 
responsible for the phenotype. See, e.g., Kruglyak et al., Am. J. Hum. Genet, 58, 1347-1363, 
1996;MatiseetaL,iVa^. Genet., 6(4):384-90, 1994. 

Assessing the effects of therapeutic and preventative interventions (e.g., 
administration of a dmg, chemotherapy, radiation, etc.) is a major effort m drag discovery, 
clinical medicine, and pharmacogenomics. The evaluation of therapeutic and preventative 
measures, whether experimental or akeady in clinical use, has broad applicabiUty, e.g., in 
clinical trials, for monitoring the status of a patient, for analyzing and assessing animal 
models, and in any scenario involving disease treatment and prevention. Analyzing the 
expression profiles of polynucleotides of the present mvention can be utilized as a parameter 
by which mterventions are judged and measured. Treatment of a disorder can change the 
expression profile in some manner which is prognostic or indicative of the drag's effect on it. 
Changes in the profile can indicate, e.g., drag toxicity, return to a normal level, etc. 
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Accordingly, the preset inv^tion also relates to methods of monitoring or assessing a 
therapeutic or prevmtative measure (e.g., chraiofherapy, radiation, anti-neoplastic drugs, 
antibodies, etc.) in a subject having a disorder, or, susceptible to such a disorder, comprising, 
e.g., detecting the expression levels of polynucleotides. A subject can be a cell-based assay 
5 system, non-human animal model, human patient, etc. Detecting can be accomplished as 
described for the methods above and below. By *therapeutic or preventative intervention," it 
is meant, e.g., a drug administered to a patient, surgery, radiation, chemotherapy, and other 
measures taken to prevent, treat, or diagnose a disorder. Expression can be assessed in any 
sample comprising any tissue or cell type, body fluid, etc., as discussed for other methods of 

10 the present invention. 

The present invention also relates to methods of using polynucleotides binding 
partners, such as antibodies, to deliver active agents to a tissue for a variety of different 
purposes, including, e.g., for diagnostic, therapeutic, and research purposes. Methods can 
involve delivering or administering an active agent to the target tissue, comprising, e.g., 

1 5 administering to a subject in need thereof, an effective amount of an active agent coupled to a 
biuding partner specific for a polypeptide of the present invention, wherein said binding 
partner is effective to deliver said active agent specifically to the taret tissue. 

Any type of active agent can be used in combination with polynucleotides, including, 
therapeutic, cytotoxic, cytostatic, chemotherapeutic, anti-neoplastic, anti-proliferative, anti- 

20 biotic, etc., agents. A chemotherapeutic agent can be, e.g., DNA-interactive agent, alkylating 
agent, antimetabolite, tubulin-interactive agent, hormonal agent, hydroxyurea, Cisplatin, 
Cyclophosphamide, Altretamine, Bleomycin, Dactinomycin, Doxorubicin, Etoposide, 
Teniposide, pacUtaxel, Cytoxan, 2-methoxycarbonylaminobenzimidazole, Plicamycin, 
Methotrexate, Fluorouracil, Fluorodeoxyuridin, CB3717, Azacitidine, Floxuridine, 

25 Mercapyopurine, 6-Thioguanine, Pentostatin, Cytarabine, Fludarabine, etc. Agents can also 
be contrast agents usefiil in imaging technology, e.g.. X-ray, CT, CAT, MRI, ultrasound, 
PET, SPECT, and sdntographic. 

An active agent can be associated in any manner with a polynucleotides binding 
partner which is effective to achieve its delivery specifically to the target. Specific delivery 

30 or targeting indicates that the agent is provided to the target, without being substantially 
provided to other tissues. This is useful especially where an agent is toxic, and specific 
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targeting enables the majority of flie toxicity to be aimed at the target tissue, wifii as small as 
possible effect on other tissues in the body. The association of the active agent and the 
binding partner ("coupling") can be direct, e.g., through chemical bonds between flxe binding 
partner and the agent, or, via a linking agent, or the association can be less direct, e.g., where 
5 the active agent is in a liposome, or other carrier, and the binding partner is associated with 
the liposome surface. In such case, the binding partner can be oriented in such a way that it is 
able to bind to polypolypeptide on the cell surface. Methods for delivery of DNA via a cell- 
surface receptor is described, e.g., in U.S. Pat. No. 6,339,139. 

1 0 Identifying agent methods 

The present invention also relates to methods of identifying agents, and the agents 
themselves, which modulate polynucleotides and polypeptides of the present invention. 
These agents can be used to modulate the biological activity of the polypeptide encoded for 
by the gene, or the gene, itself. Agents which regulate the gene or its product are useful in 

15 variety of different environments, including as medicinal agents to treat or prevent disorders 
associated with polynucleotides of the present invention and as research reagents to modify 
the function of tissues and cell. 

Methods of identifying agents generally comprise steps in which an agent is placed in 
contact with the gene, its transcription product, its translation product, or other target, and 

20 then a determination is performed to assess whether the agent "modulates" the target. The 
specific method utilized will depend upon a number of factors, including, e.g., the target (i.e., 
is it the gene or polypeptide encoded by it), the environment (e.g., in vitro or in vivo), the 
composition of the agent, etc. 

For modulating the expression of a gene or polynucleotide, a method can comprise, in 

25 any effective order, one or more of the following steps, e.g., contacting a polynucleotide or 
gene (e.g., in a cell population) with a test agent under conditions effective for said test agent 
to modulate its expression, and determining whether said test agent modulates it An agent 
can modulate expression of polynucleotides at any level, including transcription (e.g., by 
modulating fibe promoter), translation, and/or perdurance of the nucleic acid (e.g,, 

30 degradation, stability, etc.) in the cell. 
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For modulating the biological activity of a polypeptide, a method can comprise^ in 
any effective order, one or more of the following steps, e.g., contacting a polypq>tide (e.g., in 
a cell, lysate, or isolated) with a test agent imder conditions effective for said test agent to 
modulate the biological activity of said polypeptide, and determining whether said test agent 
5 modulates said biological activity. 

Contactmg polynucleotides with the test agent can be accomplished by any suitable 
method and/or means that places the agent in a position to functionally control expression or 
biological activity. Functional control indicates that the agent can exert its physiological 
eflFect on polynucleotides through whatever mechanism it works. The choice of the method 

10 and/or means can depend upon the nature of the agent and the condition and type of 
environment in which the polynucleotides is presented, e.g., lysate, isolated, or in a cell 
population (such as, in vivo, in vitro, organ explants, etc.). For instance, if the cell population 
is an in vitro cell culture, the agent can be contacted with the cells by adding it directly into 
the culture medium. If the agait cannot dissolve readily in an aqueous medium, it can be 

15 incorporated into liposomes, or another lipophilic carrier, and then administered to the cell 
culture. Contact can also be faciUtated by incorporation of agent with carriers and delivery 
molecules and complexes, by injection, by infusion, etc. 

Agents can be directed to, or targeted to, any part of the polypeptide which is 
effective for modulating it. For example, agents, such as antibodies and small molecules, can 

20 be targeted to cell-surface, exposed, extracellular, ligand binding, functional, etc., domains of 
the polypeptide. Agents can also be directed to intracellular regions and domains, e.g., 
regions where the polypeptide couples or interacts with intracellular or intramembrane 
binding partners. 

After the agent has been administered in such a way that it can gain access, it can be 
25 determined whether the test agent modulates expression or biological activity. Modulation 
can be of any type, quality, or quantity, e.g,, increase, facilitate, enhance, up-regulate, 
stimulate, activate, amplify, augment, induce, decrease, down-regulate, diminish, lessen, 
reduce, etc. The modulatory quantity can also encompass any value, e.g., 1%, 5%, 10%, 
50%, 75%, 1-fold, 2-fold, 5-fold, 10-fold, 100-fold, etc. To modulate expression means, e.g., 
30 that the test agent has an effect on its expression, e.g., to effect the amount of transcription, to 
effect RNA splicing, to effect translation of the RNA into polypeptide, to effect RNA or 
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polypeptide stability, to effect polyadenyiation or other processing of the RNA, to effect post- 
transcriptional or post-translational processing, etc. To modulate biological activity means, 
e.g., that a functional activity of the polypeptide is changed in comparison to its nomial 
activity in the absence of the agent This effect includes, increase, decrease, block, inhibit, 
5 enhance, etc. 

A test agent can be of any molecular composition, e.g., chemical compoimds, 
biomolecules, such as polypeptides, lipids, nucleic acids (e.g., antisense to a polynucleotide 
sequence selected from SEQ ID NOS 1, 11, 16, 25, 38, 43, 45, 47, 49, 51, and/or 58), 
carbohydrates, antibodies, ribozymes, double-stranded RNA, aptamers, etc. For example, if a 

10 polypeptide to be modulated is a cell-surface molecule, a test agent can be an antibody that 
specifically recognizes it and, e.g., causes the polypeptide to be internalized, leading to its 
down regulation on the surface of the cell. Such an effect does not have to be permanent, 
but can require the presence of the antibody to continue tiie down-regulatory effect. 
Antibodies can also be used to modulate the biological activity of a polypeptide in a lysate or 

1 5 other cell-free form. 

Additional cell-based test sj^tems suitable for the analysis of GPCR polypeptides are 
summarized in Marchese et al. (1999, Trends in Pharmacol. Sd. 20: 370-375) and comprise 
so-called "ligand screening assays." For example in yeast cells the pheromon receptor can be 
replaced by a GPCR according to the invention. The effect of test substances on the receptor 

20 can be determined upon modulation of histidine synthesis, i.e. by growing in histidine-free 
medium. In addition using cells transfected with nucleic acids according to the invention it 
can be analyzed whether test substances mediate translocation of a detectable arrestins, for 
example of a arrestin-GFP-ftision protein. MoreovCT, it can be analyzed whether test 
substances mediate GPCR-mediated dispersion or aggregation of Xenopus laevis 

25 melanophores. Another test system utilizes the universal adapter G-protein G alphal6, which 
mobilizes Ca.sup.2+. Other screening test systems are described in Lemer et al., supra; 
W096/41 169; U.S. Pat No. 5,482,835; WO99/06535; EP 0 939 902; W099/66326; 
W098/34948; EP 0 863 214; U.S. Pat No. 5,882,944 and U.S. Pat No. 5,891,641. 

30 Therapeutics 

Selective polynucleotides, polypeptides, and specific-binding partners thereto, can be 
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utilized in fherapeutic applications. Use&l methods include, but are not limited to, 
immunotherapy (e.g., using specific-binding partners to polypeptides), vaccination (e.g., 
using a selective polypeptide or a naked DNA encoding such polypq)tide), protein or 
polypeptide replacement therapy, gene therapy (e.g., germ-line correction, antisCTse), etc. 
5 Various immunotherapeutic approaches can be used. For instance, imlabeled 

antibody that specifically recognizes a tissue-specific antigen can be used to stimulate the 
body to destroy or attack a cancer or other diseased tissue, to cause down-regulation, to 
produce complement-mediated lysis, to inhibit cell growth, etc., of target cells which display 
the antigen, e.g., analogously to how c-erbB-2 antibodies are used to treat breast cancer. In 

10 addition, antibody can be labeled or conjugated to enhance its deleterious effect, e.g., with 
radionuclides and other energy emitting eatitities, toxins, such as ricin, exotoxin A (ETA), 
and diphtiieria, cytotoxic or cytostatic agents, immunomodulators, chemotherapeutic agents, 
etc. See, e.g., U.S. Pat No. 6,107,090. 

An antibody or other specific-binding partner can be conjugated to a second molecule, 

1 5 such as a cytotoxic agent, and used for targeting the second molecule to a tissue-antigen 

positive cell (Vitetta, E. S. et al., 1993, Immunotoxin therapy, in DeVita, Jr., V, T. et al., eds. 
Cancer: Principles and Practice of Oncology, 4th ed., J. B. lippincott Co., Philadelphia, 
2624-2636). Examples of cytotoxic agents include, but are not limited to, antimetabolites, 
alkylating agents, anthracyclines, antibiotics, anti-mitotic agents, radioisotopes and 

20 chemotherapeutic agents. Further examples of cytotoxic agents include, but are not limited to 
ricin, doxorubicin, daunorubicin, taxol, ethidium bromide, mitomycin, etoposide, tenoposide, 
vincristiQe, vinblastine, colchicine, dihydroxy anthracin dione, actinomycin D, 1- 
dehydrotestosterone, diptheria toxin, Pseudomonas exotoxin (PE) A, PE40, abrin, elongation 
factor-2 and glucocorticoid. Techniques for conjugating therapeutic agents to antibodies are 

25 well. 

hi addition to inmiunotfaerapy, polynucleotides and polypeptides can be used as 
targets for non-immunotherapeutic applications, e.g., using compounds which interfere with 
function, expression (e.g., antisense as a th^peutic agent), assembly, etc. RNA interference 
can be used in vitro and in vivo to silence polynucleotides when its expression contributes to 
30 a disease (but also for otiier purposes, e.g., to identify the gene's Amotion to change a 
developmental pathway of a cell, etc.). See, e.g.. Sharp and Zamore, Science^ 287:2431- 
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2433, 2001 ; Grishok et al.. Science, 287:2494, 2001. 

Delivery of therapeutic agents can be achieved according to any effective method, 
including, liposomes, viruses, plasmid vectors, bacterial delivery systems, orally, 
systOTiically, etc. Therapeutic agents of the present invention can be administered in any 
5 form by any effective route, including, e.g., oral, parenteral, enteral, intraperitoneal, topical, 
transdermal (e.g., using any standard patch), intravenously, ophthalmic, nasally, local, non- 
oral, such as aerosal, inhalation, subcutaneous, intramuscular, buccal, sublingual, rectal, 
vaginal, intra-arterial, and intrathecal, etc. They can be administered alone, or in 
combination with any ingredient(s), active or inactive. 

10 In addition to tiierapeutics, per se, the present invention also relates to methods of 

treating a disease showing altered expression of a polynucleotide or polypeptide of the 
present invention, comprising, e.g., administering to a subject in need thereof a therapeutic 
agent which is effective for regulating expression of said polynucleotide or polypeptide 
which is effective in treating said disease. The term ^treating*' is used conventionally, e.g., 

15 the management or care of a subject for the purpose of combating, alleviating, reducing, 
relieving, improving the condition of, etc., of a disease or disorder. By the phrase "altered 
expression," it is meant that the disease is associated with a mutation in the gene, or any 
modification to the gene (or corresponding product) which affects its normal function. Thus, 
expression of polynucleotides refers to, e.g., transcription, translation, splicing, stability of 

20 the mRNA or protein product, activity of the gene product, differential expression, etc. 

Any agent which ^treats" the disease can be used. Such an agent can be one which 
regulates the expression of the polynucleotides. Expression refers to the same acts already 
mentioned, e.g. transcription, translation, splicing, stability of the mRNA or protein product^ 
activity of the gene product, differential expression, etc. For instance, if the condition was a 

25 result of a complete deficiency of the gene product, administration of gme product to a 
patient would be said to treat the disease and regulate the gene's expression. Many other 
possible situations are possible, e.g., where the gene is aberrantly expressed, and the 
tiherapeutic agent regulates the aberrant expression by restoring its normal expression pattern. 



30 Antisense 
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Antisense polynucleotide (e.g., RNA) can also be prepared fix)m a polynucleotide 
according to the present invention, preferably an anti-sense to a sequence of SEQ ID NOS 1, 
11, 16, 25, 38, 43,45, 47, 49, 51, and/or 58. Antisense polynucleotide can be used in various 
ways, such as to regulate or modulate expr^sion of the polypeptides they encode, e.g., inhibit 
5 their expression, for in situ hybridization, for therapeutic purposes, for making targeted 

mutations (in vivo, triplex, etc.) etc. For guidance on administering and designing anti-sense, 
see, e.g., U.S. Pat. Nos. 6,200,960, 6,200,807, 6,197,584, 6,190,869, 6,190,661, 6,187,587, ' 
6,168,950, 6,153,595, 6,150,162, 6,133,246, 6,117,847, 6,096,722, 6,087,343, 6,040,296, 
6,005,095, 5,998,383, 5,994,230, 5,891,725, 5,885,970, and 5,840,708. An antisense 

10 polynucleotides can be operably linked to an expression control sequence. A total length of 
about 35 bp can be used in cell culture with cationic liposomes to facilitate cellular uptake, 
but for in vivo use, preferably shorter oligonucleotides are administered, e.g. 25 nucleotides. 

Antisense polynucleotides can comprise modified, nonnaturally-occurring nucleotides 
and linkages between the nucleotides (e.g., modification of the phosphate-sugar backbone; 

1 5 methyl phosphonate, phosphorothioate, or phosphorodithioate linkages; and 2'-0-methyl 

ribose sugar units), e.g., to enhance in vivo or in vitro stability, to confer nuclease resistance, 
to modulate uptake, to modulate cellular distribution and compartmentalization, etc. Any 
effective nucleotide or modification can be used, including those already mentioned, as 
known in the art, etc., e.g., disclosed in U.S. Pat. Nos. 6,133,438; 6,127,533; 6,124,445; 

20 6,121,437; 5,218,103 (e.g., nucleoside tiiiophosphoramidites); 4,973,679; Sproat et al., "2*-0- 
Methyloligoribonucleotides: synthesis and applications,'' Oligonucleotides and Analogs A 
Practical Approach, Eckstein (ed.), IRL Press, Oxford, 1991, 49-86; Iribarren et al., "2'0- 
Alkyl Oligoribonucleotides as Antisense Probes," Proc. Natl. Acad. Sci. USA, 1990, 87, 
7747-7751; Cotton et al., "2 -O-methyl, 2-O-ethyl oligoribonucleotides and phosphorothioate 

25 oligodeoxyribonucleotides as inhibitors of the in vitro U7 snRNP-dependent mRNA 
processing event,** Nucl. Acids Res., 1991, 19, 2629-2635. 



Arrays 

Hie present invention also relates to an ordered array of polynucleotide probes and 
30 specific-binding partners (e.g., antibodies) for detecting the expression of polynucleotides or 
polypeptides in a sample, comprising, e.g., one or more polynucleotide probes or specific 
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binding partners associated with a solid siq>port or in separate receptacles, wherein each 
probe is specific for polynucleotides, and tiie probes comprise a nucleotide sequence of SEQ 
IDNOS 1, 11, 16, 25, 38, 43,45, 47,49, 51, and/or 58 which is specific for said gme, a 
nucleotide sequence having sequence identity to SEQ ID NOS 1, 1 1, 16, 25, 38, 43, 45, 47, 
5 49, 51 , and/or 58 which is specific for said gene or polynucleotide, or complements thereto, 
or a specific-binding partner which is specific for polynucleotides. 

The phrase "ordered array*' indicates that the probes (polynucleotides, binding- 
partn^, polypeptides, etc.) are arranged in an identifiable or position-addressable pattern, 
e.g., such as the arrays disclosed in U.S. Pat. Nos. 6,156,501, 6,077,673, 6,054,270, 

10 5,723,320, 5,700,637, WO099197 11, WO00023803. The probes are associated with the 
solid support in any effective way. For instance, the probes can be bound to tiie solid 
support, either by polymerizing the probes on the substrate, or by attaching a probe to the 
substrate. Association can be, covalent, electrostatic, noncovalent, hydrophobic, hydrophilic, 
noncovalent, coordination, adsorbed, absorbed, polar, etc. When fibers or hollow filaments 

1 5 are utilized for the array, the probes can fill the hollow orifice, be absorbed into the solid 
filament, be attached to the surface of the orifice, etc. Probes can be of any eflFective size, 
sequence identity, composition, etc., as already discussed. 



Transgenic animals 

20 The present invention also relates to transgenic animals comprising polynucleotides 

genes, and homologs thereof. (Methods of making transgenic animals, and associated 
recombinant technology, can be accomplished conventionally, e.g., as described in 
Transgenic Animal Technology, Pinkext et al., 2°"^ Edition, Academic Press, 2002.) Such 
genes, as discussed in more detail below, include, but are not limited to, fimctionally- 

25 disrupted genes, mutated genes, ectopically or selectively-expressed genes, indudble or 

regulatable genes, etc. These transgenic animals can be produced according to any suitable 
technique or method, including homologous recombination, mutagenesis (e.g., ENU, 
Rathkolb et al., Exp. Physiol, 85(6):635-644, 2000), and the tetracycline-regulated gene 
expression system (e.g., U^S. Pat. No, 6,242,667). The term "gene" as used herein includes 

30 any part of a gene, i.e., regulatory sequences, promoters, enhancers, exons, introns, coding 
sequences, etc. The polynucleotides nucleic add present in the construct or transgene can be 
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naturally-'Occuiring wild-type, polymorphic, or mutated. Where the animal is a non-human 
animal, its homolog can be used instead. Transgenic animals can be susceptible to any of the 
dieases and disorders mentioned herein, e.g., as desoribed more particularly under the 
descriptions of each gene. 
5 Along these lines, polynucleotides of the present invention can be used to create 

transgenic animals, e.g. a non-human animal, comprising at least one cell whose genome 
comprises a functional disruption of a polynucleotide of the present invention, or a homolog 
thereof (e.g., a mouse homolog when a mouse is used). By the phrases 'Afunctional 
disraption" or ^functionally disrupted," it is meant that the gene does not express a 
10 biologically-active product. It can be substantially deficient in at least one functional activity 
coded for by the gene. Expression of a polypeptide can be substantially absent, i.e., 
essentially undetectable amounts are made. However, polypeptide can also be made, but 
which is deficient in activity, e.g., where only an amino-terminal portion of the gene product 
is produced. 

1 5 The transgenic atiimal can comprise one or more cells. When substantially all its 

cells contain the engineered gene, it can be referred to as a transgenic animal ^'whose genome 
comprises" the engineered gene. This indicates that the endogenous gene loci of the animal 
has been modified and substantially all cells contain such modification. 

Functional disruption of the gene can be accomplished in any effective way, 

20 including, e.g., introduction of a stop codon into any part of the coding sequence such that the 
resulting polypeptide is biologically inactive (e.g., because it lacks a catalytic domain, a 
ligand binding domain, etc.), introduction of a mutation into a promoter or other regulatory 
sequence that is effective to turn it off, or reduce transcription of the gene, insertion of an 
exogenous sequence into the gene which inactivates it (e.g., which disrupts the production of 

25 a biologicaUy-active polypeptide or which disrupts the promoter or other transcriptional 
machinery), deletion of sequences firom the gene (or homolog thereof), etc. Examples of 
transgenic animals having functionally disrupted genes are well known, e.g., as described in 
U.S. Pat Nos. 6,239,326, 6,225,525, 6,207,878, 6,194,633, 6,187,992, 6,180,849, 6,177,610, 
6,100,445, 6,087,555, 6,080,910, 6,069,297, 6,060,642, 6,028,244, 6,013,858, 5,981,830, 

30 5,866,760, 5,859,314, 5,850,004, 5,817,912, 5,789,654, 5,777,195, and 5,569,824. A 
transgenic animal which comprises the functional dismption can also be referred to as a 
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'Tmock-ouf ' animal, since the biological activity has been **knocked-out" Knock-outs can be 
homozygoTis orheterozygovis. 

For creating functionally disrupted genes, and other gene mutations, homologous 
recombination technology is of special interest since it allows specific regions of the genome 
5 to be targeted. Using homologous recombination methods, genes can be specifically- 
inactivated, specific mutations can be introduced, and exogenous sequences can be 
introduced at specific sites. These methods are well known in the art, e.g., as described in the 
patents above. See, also, Robertson, Biol Reproduc, 44(2):23 8-245, 1991. Generally, the 
gmetic engineering is performed in an embryonic stem (ES) cell, or other pluripotent cell line 

10 (e.g., adult stem cells, EG cells), and that genetically-modified cell (or nucleus) is used to 
create a whole organism. Nuclear transfer can be used in combination with homologous 
recombination technologies. 

For example, the polynucleotides locus can be disrupted in mouse ES cells using a 
positive-negative selection method (e.g., Mansour et al.. Nature^ 336:348-352, 1988). hi this 

15 method, a targeting vector can be constructed which comprises a part of the gene to be 
targeted. A selectable marker, such as neomycin resistance genes, can be inserted into a 
polynucleotides exon present in flie targeting vector, disrupting it When the vector 
recombines with the ES cell genome, it disrupts the function of flie gene. The presence in the 
cell of the vector can be detemiined by expression of neomycin resistance. See, e.g., U.S. 

20 Pat No. 6,239,326. Cells having at least one functionally disrupted gene can be used to 
make chimeric and gerailine animals, e.g., animals having somatic and/or germ cells 
comprising the engineered gene. Homozygous knock-out animals can be obtained from 
breeding heterozygous knock-out animals. See, e.g., U.S. Pat. No. 6,225,525. 

The present invention also relates to non-human, transgenic animal whose genome 

25 comprises recombinant polynucleotides nucleic acid (and homologs thereof) operatively 

linked to an expression control sequence effective to express said codiag sequence. Such a 
transgenic animal can also be referred to as a 'Imock-in" animal since an exogenous gene has 
been introduced, stably, into its genome. 

A recombinant nucleic acid refers to a polynucleotide which has been introduced into 

30 a target host cell and optionally modified, such as cells derived from animals, plants, bacteria, 
yeast, etc. A recombiuant nucleic acid includes completely synthetic nucleic add sequences. 
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semi-syatiietic nucleic acid sequences, sequences derived fiom natural sources, and chimeras 
thereof "Op^able linkage" has the meaning xised through the specification, i.e., placed in a 
functional relationship with another nucleic acid. When a gene is operably linked to an 
expression control sequence, as explained above, it indicates that the gene (e.g., coding 
5 sequence) is joined to the expression control sequence (e.g., promote) in such a way that 
facilitates transcription and translation of the coding sequence. As described above, the 
phrase "genome" indicates that the genome of the cell has been modified. In this case, the 
recombinant polynucleotide has been stably integrated into the genome of the animal. The 
nucleic acid (e.g., a coding sequence) in operable linkage with the expression control 

1 0 sequence can also be referred to as a construct or transgene. 

Any expression control sequence can be used depending on the purpose. For 
instance, if selective expression is desired, then expression control sequences which limit its 
expression can be selected. These include, e.g., tissue or cell-specific promoters, introns, 
enhancers, etc. For various methods of cell and tissue-specific expression, see, e.g., U.S. Pat. 

15 Nos. 6,215,040, 6,210,736, and 6,153,427. These also include the endogenous promoter, i.e., 
the coding sequence can be operably linked to its own promoter. Mducible and regulatable 
promoters can also be utilized. 

The present invention also relates to a transgenic animal which contains a fimctionally 
disrupted and a transgene stably integrated into the animal's genome. Such an animal can be 

20 constructed using combinations any of the above- and below-mentioned methods. Such 
animals have any of the aforementioned uses, including permitting the knock-out of the 
normal gene and its replacement with a mutated gene. Such a transgene can be integrated at 
the endogenous gene locus so that the fimctional disruption and **knock-in" are carried out in 
the same step. 

25 In addition to the methods mentioned above, transgenic animals can be prepared 

according to known methods, including, e.g., by pronuclear injection of recombinant genes 
into pronuclei of 1-cell embryos, incorporating an artificial yeast chromosome into 
embryonic stem cells, gene targeting methods, embryonic stem cell methodology, cloning 
methods, nuclear transfer methods. See, also, e.g., U.S. Patent Nos. 4,736,866; 4,873,191; 

30 4,873,316; 5,082,779; 5,304,489; 5,174,986; 5,175,384; 5,175,385; 5,221,778; Gordon et al., 
Proc. Natl. Acad. Sd., 77:7380-7384, 1980; Pahniter et al., CeU, 41:343-345, 1985; Pakniter 
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et al., Ann. Rev. Genet, 20:465-499, 1986; Askew et al., Mol. CeU. Bio., 13:4115-4124, 
1993; Games et al. Nature, 373:523-527, 1995; Valancius and Smithies, MoL CeU. Bio.. 
1 1:1402-1408, 1991; Stacey et al., Mol. CeU. Bio., 14:1009-1016, 1994; Hasty et al.. Nature, 
350:243-246, 1995; Rubinstein et al., Nucl. Acid Res., 21:2613-2617,1993; Cibelli et al., 
5 Scirace, 280:1256-1258, 1998. For guidance on recombinase excision systems, see, e.g., 
U.S. Pat. Nos. 5,626,159, 5,527,695, and 5,434,066. See also, Orban, P.C., et al., 'Tissue- 
and Site-Specific DNA Recombination in Transgenic Mice," Proc. Natl. Acad. Sci. USA, 
89:6861-6865 (1992); O'Gonnan, S., et al., ^Tlecombinase-Mediated Gene Activation and 
Site-Specific Integration in Manomalian Cells," Science, 251 :1351-1355 (1991); Sauer, B., et 

1 0 al., "Cre-stimulated recombination at loxP-Containing DNA sequences placed into the 

mammalian genome," Polynucleotides Research, 17(1): 147-161 (1989); Gagneten, S. et al. 
(1997) Nucl. Acids Res. 25:3326-3331; Xiao and Weaver (1997) Nucl. Acids Res. 25:2985- 
2991; Agah, R. et al. (1997) J. Clin. Invest 100:169-179; Barlow, C. et al. (1997) Nucl. 
Acids Res. 25:2543-2545; Araki, K. et al. (1997) Nucl. Acids Res. 25:868-872; Mortensen, 

15 R. N. et al. (1992) MoL Cell. Biol. 12:2391-2395 (G418 escalation method); Lakhlani, P. P. 
et al. (1997) Proc. Natl. Acad. Sci. USA 94:9950-9955 ("hit and run"); Westphal and Leder 
(1997) Curr. Biol. 7:530-533 (transposon-generated *1aiock-ouf' and "knock-in'O; 
Templeton, N. S. et al. (1997) Gene Ther. 4:700-709 (methods for eflBcient gene targeting, 
allowing for a high firequency of homologous recombination events, e.g., without selectable 

20 markers); PCT Intemational Publication WO 93/22443 (fimctionally-disrupted). 

A polynucleotide according to the present invention can be introduced into any 
non-human animal, including a non-human mammal, mouse (Hogan et al.. Manipulating the 
Mouse Embryo: A Laboratory Manual. Cold Spring Harbor Laboratory, Cold Spring Harbor, 
New York, 1986), pig (Hammer et al.. Nature, 315:343-345, 1985), sheep (Hammer et al., 

25 Nature, 315:343-345, 1985), cattle, rat, or primate. See also, e.g.. Church, 1987, Trends in 
Biotech. 5:13-19; Clark et al.. Trends in Biotech. 5:20-24, 1987); and DePamphilis et al., 
BioTechniques, 6:662-680, 1988. Transgenic animals can be produced by the methods 
described in U.S. Pat. No. 5,994,618, and utilized for any of the utilities described therein. 

30 Database 

The present invention also relates to electronic forms of polynucleotides. 
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pol^^eptides, etc., of the present invmtion, including computer-readable medium (e.g., 
magnetic, optical, etc., stored in any suitable format, such as flat files or hierarchical files) 
which comprise such sequmces, or fi:agments thereof e-commerce-related means, etc. 
Along these Unes, the present invention relates to methods of retrieving gcae sequences fit)m 
5 a computer-readable medium, comprising, one or more of the following steps in any effective 
order, e.g., selecting a cell or gene expression profile, e.g., a profile that specifies that said 
gene is expressed in a particular (see the expression profiles described above), and retrieving 
said differentially expressed gene sequences, where the gene sequences consist of the genes 
represented by SEQ ID NOS 1,11, 16, 25, 38, 43, 45, 47, 49, 51, and/or 58, or the 

10 polypeptides encoded thereby. 

A "gene expression profile" means the list of tissues, cells, etc., in which a defined 
gene is expressed (i.e, transcribed and/or translated). A "cell expression profile" means the 
genes which are expressed in the particular cell type. The profile can be a list of the tissues 
in which the gene is expressed, but can include additional information as well, including 

1 5 level of expression (e.g., a quantity as compared or normalized to a control gene), and 

information on temporal (e.g., at what point in the cell-cycle or developmmtal program) and 
spatial expression. By the phrase "selecting a gene or cell expression profile," it is meant that 
a user decides what type of gene or cell expression pattern he is interested in retrieving. Any 
pattem of expression preferences may be selected. The selecting can be performed by any 

20 effective method. In general, "selecting" refers to the process in which a user forms a query 
that is used to search a database of gene expression profiles. The step of retrieving involves 
searching for results in a database that correspond to the query set forth in the selecting step. 
Any suitable algorithm can be utilized to perform the search query, including algorithms that 
look for matches, or that perform optimization between query and data. The database is 

25 information that has been stored in an appropriate storage medium, having a suitable 

computer-readable format. Once results are retrieved, they can be displayed in any suitable 
format, such as HTML A query is formed by the user to retrieve the set of genes firom the 
database having the desired property. Once the query is inputted into the system, a search 
algorithm is used to interrogate the database, and retrieve results. 

30 

Advertising, licensing, etc., methods 
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The present invention also relates to me&ods of advertising, licensing, selling, 
purchasing, brokering, etc., genes, polynucleotides, specific-binding partners, antibodies, etc., 
of the present invention. Methods can comprises, e.g., displaying a gene, polynucleotide, 
polypeptide, or antibody specific for a polypeptide in a printed or computer-readable medium 
5 (e.g., on the Web or Intemet), accepting an offer to purchase said gene, polypeptide, or 
antibody, etc. 



Other 

A polynucleotide, probe, polypeptide, antibody, specific-binding partner, etc., 

1 0 according to flie present invention can be isolated. The term "isolated" means that the 
material is in a form in which it is not found in its original environment or in nature, e.g., 
more concentrated, more purified, separated from component, etc. An isolated 
polynucleotide includes, e.g., a polynucleotide having flie sequenced separated from the 
chromosomal DNA found in a living animal, e.g., as the complete gene, a transcript, or a 

1 5 cDNA. This polynucleotide can be part of a vector or inserted into a chromosome (by 

specific gene-targeting or by random integration at a position other than its normal position) 
and still be isolated in that it is not in a form that is found in its natural environment. A 
polynucleotide, polypeptide, etc., of the present invention can also be substantially purified. 
By substantially purified, it is meant that polynucleotide or polypeptide is separated and is 

20 essentially free from other polynucleotides or polypeptides, i.e., the polynucleotide or 
polypeptide is the primary and active constituent. A polynucleotide can also be a 
recombinant molecule. By "recombinant," it is meant that the polynucleotide is an 
arrangement or form which does not occur in nature. For instance, a recombinant molecule 
comprising a promoter sequmce would not encompass the naturally-occurring gene, but 

25 would include the promoter operably linked to a coding sequence not associated with it in 
nature, e.g., a reporter gene, or a truncation of the normal coding sequence. 

The term '"marker" is used herein to indicate a means for detecting or labeling a 
target. A marker can be a polynucleotide (usually referred to as a **probe"), polypeptide (e.g., 
an antibody conjugated to a detectable label), PNA, or any effective material. 
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The topic headings forth above are meant as guidance where certain information 
can be found in flie application, but are not intended to be the only source in the application 
where information on such topic can be found. Reference materials 

For other aspects of the polynucleotides, reference is made to standard textbooks of 
5 molecular biology. See, e.g., Hames et al.. Polynucleotide Hybridization. IL Press, 1985; 
Davis et al., Basic Methods in Molecular Biology. Elsevir Sciences Publishing, Inc., New 
York, 1986; Sambrook et al.. Molecular Cloning, CSH Press, 1989; Howe, Gene Cloning 
and Manipulation, Cambridge University Press, 1995; Ausubel et al.. Current Protocols in 
Molecular Biology, John Wiley & Sons, Inc., 1994-1998. 

10 Without further elaboration, it is beUeved that one skilled ia the art can, using tiie 

preceding description, utilize the present invention to its fullest extent. The following 
preferred specijBc embodiments are, therefore, to be construed as merely illustrative, and not 
limitative of the remainder of the disclosure in any way whatsoever. The entire disclosure of 
all applications, patents and publications, dted above and in the figures are hereby 

1 5 incorporated by reference in their entirety, including U.S. Serial No. 1 0/094,092, filed March 
1 1, 2002, U.S. Serial No. 10/1 12,372, filed April 1, 2002, U.S. Serial No. 60/382, 614, filed 
May 24, 2002, U.S. Serial No. 10/164,717, filed June 10, 2002, U.S. Serial No. 10/167,631, 
filed June 13, 2002, U.S. Serial No. 10/177,917, filed June 24, 2002, and U.S. Serial No. 
60/399,125, filed 30 July 2002, which are hereby incorporated by reference in their entirety. 



20 
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TABLE 1 
Variant" Nucleotidechangd 



ProlSAIa 

86insA 

VaieOLeu 

Ala64Ser . 

Aig67Gli). 

Mie7$Tyr. 

Asp84Glu 

AlaSlPro 

Val92Met 

Thi95Met 

Val97Ile 

AlalOSyal 

Glyi04Ser 

Leul06Gln 

Leul06LeQ 

Argl42His 

ArglSlCys 

ArglSlAig 

flel55Thr 

ArgieOTip 

Aigl63GIn: 

Yall73del 

Vall74ne 

537insC 

Prt>230L6u 

Pro230PW) 

Gln233Gln 

His260Pro 

I]e264Ile 

Cys273Cys 

Lys278Glu 

Asn279Ser 

Asn279Lys 

Ile287Met 

Asp294His 

Pb6306l>he 

Thr314Thr 

'Set316Ser 



520G 

178G>T 
1900 >T 
..200G>A. 
2271* > A 
252C > A 
241G>C 
.274G>A 
284C>T 
289G>A 
308C > T 
310G>A- 
317T>A 
318G>A 
425G>A 
451C>T 
4530G 
464T>C 
478C > T 
488G>A 

520G>A 

6890T 

690G>A 

699G>A 

119 A > C. 

792G5>T 

819C>T 

832A>G 

836A>G 

837C > A 

861C.>G 

880Q>C 

mc>r 

942A>G 
9480T 
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TABLE2 



Allele 


Allele ReqDeiic7,% 

* Wbfte ifflnvfdoale ' 
PopalaQoiK WIfliReilHarr 


l^fitiitafliin 

VWUIUHlilwSI 

of GMIP 
Proibidlon 


VvHa TjfPB 


53 


23 


4++ 


ValoOLeut ' 


10 


3 


+ 


AlaD4Ser 


<1 


1 


NA 


Ly$65A$n 


<1 


<1 


NA 


Arg67Gln 


0| 


0 


NA 


ArgoTVal 




0 


NA 


Phc76Tyr 


<1 


<1 


NA 


A$p&4GlU 


1 


3 


+++ 


A^_rVf A 

/ten91Asp 


<1 


0 


NA 


Val92Leu 


<1 


1 


NA 


vaiyZMet 


8 


8 


-l-H- 


TnrsoiviQt 


<1 


1 


NA 


t#-|A7||. 


.<1 


<1 


NA 




<1 


<1 


Al M 

NA 


L&UIUOuin 


<1 


<1 


Al A 

NA 




<1 


1 




fU9 ivll^Ss; 


6 


25 


mm 


llelSSThr 


<1 


<1 


NA 




7 


10 




Afig163Gln 


411 


<1 


NA 


lle287Met 




0 


NA 




4 


13 




Ala299Thr 


<1 


<1 


NA 


ias291l 


<1 


<1 




in$1701| 


<1 


<1 





^Several anonymous variants have also been des(af bed, Including 
Uuioeuu. LeulSSUu. e)n233Gln, Qy82730ys. PheSOOPhe. Thr314Thr. and 
S6i916Sen MC1-R Indicates melanocortln-1 recepton cMIlP, c^lc 
adenosine monopliosphale; triple plus sign, ^gnfflcant stimuiatlon (^e as 
wild type); single plus s^n. minimal ^mulaflon; NA. data notavaHabie; and 
minus sign, no stfmolation (nonfunctional receptor), 

fPossIble association with blond/y r haic 

^Present in <1% of East/Southeast Asians. 

§Stn)ng assodaSon with red hair, falrsldn, and poortanning abil%; recent 
woik also shows an association with cutaneous melanoma and 
nonmelanoma ^n cancer. 

{Present in >70% of East/Southeast Asian and Native Americans. 

f ns Indicates insertion; these single-nudeotida Insertion mutalions 
_ JUlKiuiSJ&aDyssiilft&lhstiestdLiiLa orem^ureh/ terminated, nonf undioninn 
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Claims: 

1. Aa isolated polynucleotide comprising, 

a polynucleotide sequence which codes without intenruption for an amino acid 
sequence set forth in SEQ ID NO 2, 12, 17, 26, 39, 44, 46, 48, 50, 52, or 59, or a complement 
5 thereto. 

2. An isolated polynucleotide of claim 1 , comprising a polynucleotide sequence set forth in 
SEQ ID NO 1, 1 1, 16, 25, 38, 43, 45, 47, 49, 51, or 58, or a complement thereto. 

10 3. An isolated polynucleotide comprising, 

a polynucleotide sequence having 95% or more sequence identity along the entire 
length of tiie polynucleotide sequence set forth in SEQ ID NO 1, 1 1, 16, 25, 38, 43, 45, 47, 
49, 51, or 58 of claim 1, or a complement tiiereto. 

15 4. An isolated polynucleotide comprising, 

a human polynucleotide sequence which hybridizes under high stringency conditions 
to a polynucleotide having a polynucleotide sequence set forth in SEQ ID NO 1, 1 1, 16, 25, 
38, 43, 45, 47, 49, 51, or 58 of claim 1, or a complement thereto. 

20 5. An isolated polynucleotide of claim 4, wherein said high stringency conditions comprise 
hybridizing 42°C in 5X SSPE, 0.3% SDS, and 50% formamide, and washes at 65°C for 15 
minutes in 2X SSC, and 0.2% SDS. 

6. An isolated polypeptide comprising, 
25 the amino acid sequence set forth in SEQ ID NOS 2, 12, 17, 26, 39, 44, 46, 48, 50, 

52, or 59. 



30 



7. An isolated polypeptide comprising, 

an amino acid sequence having 95% or more sequence identity along the entire length 
of the amino acid sequence of claim 6. 
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8- An isolated polypeptide which is coded for by a polynucleotide of claim 4. 

9. A method of detecting a nucleic add coding, comprising, 

contacting a sample comprising nucleic acid with a polynucleotide probe specific for 
5 a hxmian muscle selective polynucleotide of claim 1 under conditions effective for said probe 
to hybridize specifically with said polynucleotide, and 

detecting hybridization between said probe and said nucleic acid. 

1 0. A method of claim 9, wherein said detecting is performed by: 

10 Northem blot analysis, polymerase chain reaction (PGR), reverse transcriptase PGR, 

RAGE PGR, or in situ hybridization. 

1 1 . A method of diagnosing a disease associated with abnormal expression of a gene in a 
subject, or determining a subject's susceptibility to such disease, comprising: 

1 5 assessmg the expression of said gene in said subject. 

1 2. A method of claim 1 1 , wherein assessing is: 

measuring expression levels of said gene, determiiiing the genomic structure of said 
gene, determining the mRNA structure of transcripts from said gene, or measuring ttie 
20 expression levels of polypeptide coded for by said gene, and 

13. A method of claim 1 1, wherein said assessing is performed by: 

Northem blot analysis, polymerase chain reaction (PGR), reverse transcriptase PGR, 
RAGE PGR, or in situ hybridization, and 
25 using a polynucleotide probe having a polynucleotide sequence selected from SEQ ID 

NO 1,11, 16, 25, 38, 43, 45, 47, 49, 51, or 58, or a complement fliereto. 

14. A method for identifying an agent that modulates the expression of a gene in a cell, 
comprising, 

30 contacting a cell population with a test agent under conditions effective for said test 

agent to modulate the expression of a polynucleotide of claim 1 in said cells, and 
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detennining whether said test agent modulates said polynucleotide. 

1 5. A method for identifying an agent that modulates the expression of a polypeptide coded 
for a gene, comprising, 

5 contactmg a polypeptide coded for by a polynucleotide of claim 1 , with a test agent 

under conditions effective for said test agent to modulate said polypeptide, and 
determining whether said test agent modulates said polypeptide. 

16. A method of claim 15, wherein said test agent is an antibody. 

10 

17. A method of identifying a genetic basis for a disease or disease-susceptibility, 
comprising: 

determining the association of a disease or disease-susceptibiUty with a 
polynucleotide of claim 1 . 

15 

18. A method of claim 17, wherein determining is performed by producing a human-linkage 
m^ using said polynucleotide. 

19. A method of claim 17, wherein determining is performed by comparing the nucleotide 
20 sequences of said polynucleotide between normal subjects and subjects having a muscle 

disease. 

20. A non-human, transgenic mammal, or a cell thereof, whose genome comprises a 
functional dismption of a homolog of a gene of claim 1. 

25 

21. A method of advertising genes for sale, conunercial use, or licensing, comprising, 

displaying in a computer-readable medium a polynucleotide set forth in SEQ ID NO 
1, 1 1, 16, 25, 38, 43, 45, 47, 49, 51, or 58 of claim 1, or complements thereto. 



30 



22. A method of selecting a polynucleotide sequence coding for a polypeptide, or a 
polypeptide sequence thereof from a database comprising polynucleotide sequences and/or 
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polypeptide sequences, comprising 

displaying, in a computer-readable medium, a polynucleotide sequence of claim 1, or 
polypeptide ^coded thereby, or complements to the polynucleotides sequence, 

wherein said displayed sequences have been retrieved from said database upon 
5 selection by a user. 

23. An antibody which is specific for a polypeptide of claim 6, 7, or 8. 
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* 380 * 400 * 420 
OrB860 : "EQRSNLNPGDHEDTRNAI.PPRQDGEVTTGKYATNLAESVLQDAFXRLSQSQSTLPQESAV : 420 
KXAA167B : DHEDTRNALPPRQDGEVTTGKYAITNLAE SVLQDAFXRLSQSQSTLPQESAV : 51 

* 440 * 460 * 480 
OTB860 : SVSVGSSLLPSCYSTKDa?WSRSliJNELPKIVWQSPDGSDAAPQPGISSWPEMEVSVETS : 480 
KIAA1678 : SVSVGSSLLPSCYSTKDTWSRSI3NELPKIVWQSPDGSDAAPQPGXSSWPEMEVSVETS : 111 

* 500 * 520 * 540 
OTB860 : SILSGENSSRQPQSALEVALACAATVIGTISSPQATERLKMEQWSNFPPGSSGALQTQA : 540 
KXAA1678 : SXLSGENSSRQPQSAIiEVALACAATVIGTISSPQATERLKMEQWSNFPPGSSGALQTQA : 171 



* 560 * 580 * 600 

OTB860 : PQGLKE PSINEYSF PSALCGMTQVASAVAVCGLGEREEVTCSVAPSGSLPPAAEASEAMP : 600 
KIAA167B : PQGLKEPSXNEYSPPSALCQ1TQVASAVAVCGLGEREEVTCSVAPSGSLPPAAEASEAMP : 231 
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* 620 * 64 0 * 660 
OTB860 : PLCGLASMBLGKEAXAEGLLKEAAXiVIiTRFNTYSSlGDFLDSHNRRlHETASKSQTIiCSE : 660 
K1AA1678 : PLCGLASHEIjGKEAXAEGLI.KBAAI*VI.TRFNTYSSIGDFLDSHNRRXME!FASKSQTLCSE : 291 

* 680 * 700 * 720 
OTB860 : NWRNEXiAHTLSNVXLRHSXDBVHHKNMXXDPNDNRHSSEXLDTIJ9ESTNQLLI.DVlCFT : 720 
KXAA1678 : NWRNEIAHTLSNVXLRHSXDEVHHKNMXXDSNDNRHBSEXLDTIiMESTNQLIiLDVXCFT : 351 

* 740 * 760 * 780 
OTB860 : FKKMSHXVRLGBCPAVIjSKETXRRRETEPSCQFSDPGASQAj0TKATESSSSSPZ.SNSHNT : 780 
KXAA1678 : FKKHSHXVRLGECPAVIjSKETXRRRETEPSCQPSDPGASQAaTKATESSSSSPIiSNSHNT : 411 

* 800 * 820 * 840 
OTB860 : SLVXNNIiVDGMYSKQDKGGVRPGLFKNPTLQSQLSRSHRVPDSSTATTSSKEIYLKGIAG : 840 
KIAA1678 : SLVXNNLVDC^fYSKQDKGGVRPGLFKNPTLQSQLSRSHRVPDSSTATTSSKEIYLKGIAG : 471 

* 860 * 880 * 900 
OTB860 : EDTKS PHHSENE CRASS EGQRSPTVSRSRSGSQBAEESXHPNTQEKYNCATSRINEVQVN : 900 
KIAA1678 : EDTKSPQHSENECRASSEGQRSPTVSQ'SRSGSQEAEESIHPNTQEKYNCATSRINEVQVN : 531 

* 920 * 940 * 960 
OTB860 : LSLIiGDDLIiLPAQSTLQTKHPDXYCITDPAEBLADTWSMATEIAAICLDNSSGKQPWPC : 960 
KIAA1678 : LSLLGDDLLLPAQSTLQTKHPDIYCXTDFAEELADTWSMATEIAAICLDNSSGKQPWFC : 591 

* 980 * 1000 * 1020 
OTB860 : AKTKRGSEPIiMTPNVPCRSLKRKKESQGSGTAVRKHKPPRLSBIKRKTDEHPELKEKIiMNR : 1020 
KIAA1678 : AWKRGSEFLMTPNVPCRSLKRKKBSQGSGTAVRKHKPPRLSEIKRKTDEHPELKEKIiMNR : 651 

* 1040 * 1060 * 1080 
OTB860 : WDBSMNLEDVPDSVNLFANEVAAKIMNLTEFSMVDGMWQAQGYPRNRLLSGDRWSRLKA : 1080 
KIAA1678 : VVBESMNLBDVPDSWLPANEVAAKimiiTEPSlWDGMWQAQGYPRNRLLSGDRWSRLKA : 711 

* 1100 * 1120 * 1140 
OTB86D : SSCESIPEEDSEARAYVNSLGLMSTLSQPVSRASSVSKQSSCESITDEFSRPMVNQMENE : 1140 
KIAA1678 : SSCESIPEEDSEARAYVNSLGLMSTLSQPVSRASSVSKQSSCESITDEFSRFMVNQMENE : 771 



* 1160 * 1180 * 1200 

OTB860 : GRGFELLLDYYAGKNASSILNSAMQQACRKSDHIiSVRPSCPSKQSSTESITEEFYRYMLR : 1200 
KIAA1678 : GRGFELLLDYYAGKNASSILNSAMQQACRKSDHIiSVRPSCPSKQSSTESITEEFYRYMLR : 831 
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* 1220 * 1240 * 1260 
OTB860 : DlBRDSRESASSRRSSQDRTAGLIiSPSLRSPVCHRQSSHPDSRSPCSRIiTVNVPIKANSL : 1260 
KXAA1678 : DJERDSRESASSRRSSQD0TAGLLSPSLRSPVCHRQSSHPDSRSPCSRLTVNVPIKANSX. : 891 

* 1280 * 1300 * 1320 
OTB860 : DGFAQNCPQDFLSVQPVSSASSSGLCKSDSCLYRRGGTDHITNMLIHETWASSIEALMRK : 1320 
KXAA167 8 : DGFAQNCPQDFLSVQPVSSASSSGLCKSDSCXiYRRGGTDHlTNHLlHETHASSlEAIiMRK : 951 

* 1340 * 1360 * 1380 
OTB860 : NKXXVDDAEEADTEPVSGGS PSQAEKCANRLAASRMCSGPTIiLVQESLDCPRKDSVTECK : 1380 
KXAA1678 : NKXXVDDAEEADTEPVSGGSPSQAEKCANRLAASRHCSGPTItliVQESIiDCPRKDSVTECK : 1011 

* 1400 * 1420 * 1440 
OTB860 : QPPVSSXjSKTASLTNHSPLDSKKETSSCQDPVPXNHKRRSLCSREVPLXQXEO^DQRBACA : 1440 
KXAA167e : QPPVSSXiSKTASLTNHSPLDSKKETSSCQDPVPXNHKRRSLCSREVPLXQXETDQREACA : 1071 

* 1460 * 1480 * 1500 
OTB860 : GEPE PFLSKSSLLEEAEGHSNDKNX PDWRGGDTAVSACQIHSDSLDTRDVPEAEASTEA : 1500 
KIAA167B : GEPE PFLSKSSLLEEAEGHSNDKNX PDWRGGDTAVSACQIHSDSLDTRDVPEAEASTEA : 1131 

* 1520 * 1540 * 1560 
OTB860 : RAPDEAPNPPSSSEESTGSWTQLANEEDNPDDTSSFLQLSERSMSNC^SSATSSLGXMDL : 1560 
KXAA167B : RAPDEAPNPPSSSEESTGSWTQLANEEDNPDDTSSFLQLSERSMS — ' : 1176 

* 1580 * 1600 * 1620 

OTB8 60 : pXYQESMPSSPMXNELVEEKKILKGQSESTEAPASGPPTGTASPQRSLLVINFDLEPECP : 1620 

KIAA1678 : ELVEEKKILKGQSESTEAPASGPPTGTASPQRSLLVXNFDLBPECP : 1222 

* 1640 * 1660 * 1680 
OTB860 : DAELRATLQWIAASELGIPTiyFKKSQENRXEKPLDWQLVHRKSiaKVGDXEHAWQYCK : 1680 
KIAA167B : DAELRArLQarXAASELGIPTIYPKKSQENRXEKELDWQLVHRKSWKVGDXFHAWQyCK : 1282 



* 1700 
OTB860 : MHEEQKDGRLSLFDHLLELG— 
KXAA1678 : MHEEQKDGRLSLFDWLLELG — 
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a taCC t tCaa 


4216 


aat^atclicra 


aagttag'bta 


'tgaatit'baga 


cccttggacc 


a 4* 4* ^ #^ a a !■ 

auttcagagt 


f^f^^ a /v#va ^ a ^ 

gccaggatab 


421S 




tggcgggaga 


dtctiggccct 


ggaacca'tcc 


^ ^ /«4» /*4* 
Ct uuCkCtCb 


ctgca'tc'tgg 


9 J J D 


cacd'kccaac 


accatga'tgg 


ggcttacaca 


cat^gcat^at^a 


gacccckcag 


cctgaacatc 






ccaaatagcti 


sccct^tggac 


cl^agaagt^gt 


#v#va#^a^4*4*Ma 

ggacact^ga 


agcatggbCt 


«49 D 




naa^/^^a4* 

gaa ugga ucc 


a 4* ^^a 

a tgga ugagg 


4* /^-H a ^ a ^a rta 
uCuaCctgoga 


^^4'rT^a a a ^ ^ 

cc: tgga.aagt 


a ^ a nna n 

ggcat aggag 


4516 




a fra ^a a 4* ^ ♦* ^ 

agogaa bb u u 


^^na 4* ^a ^ 

gggabCkcag 


gucicccagag 


a n^ rr^^ 4* a a n 

agtggttaag 


gcgggccigca 


457 6 




ggugcc^ucu 


tggtccccca 


ga't t 't etc a t 


cccatiaggga 


'tggt^'katzc^c 


H DO D 




ccagggcagg 


ggtzccckccc 


atccgcagct 


a a a 4* fw^^ a 

aagcaxggta 


Cbgcbggcag 


A Ma 


a^/v^/*a #v/^a 4* 


^ a a a ^ a a 


^gug^caccc 


cagccacaac 


aggtctgaat 


att'taaagaa 


H lOXi 


acccccc^rcc 


atcc t gc t t^c 


taaacgca'kg 


cccgccl^tzcc 


caagtgctgg 


1 1 cca'tcg 




cctcctttac 


tccttcattg 


gtcctagatc 


agccctagac 


agggttctcg 


gggcccctct 


4876 


ttattcccag 


tcaccccaaa 


gcccaatttc 


caagcccctt 


ccaaagccct 


ctgttgcaag 


4936 


cgcatcctcc 


ctccctcgtg 


ccccctcaat 


attcacacct 


aggtagtaga 


tgcaatactc 


4996 


tagctgccac 


tggcatgttc 


cacaagggrtt: 


tcctgctcca 


acttgggccc 


taagggattg 


5056 


attaggttgg 


ctaggttagg 


-bcccctttat 


gacaaaacca 


aaacagaatt 


gatgccccca 


5116 


cccgtcaagg 


gcacttaaaa 


aaccaaaact 


caaagtbcag 


ggccaagttt 


gaggatgtgc 


5176 


agaaaggtgt 


gtgttttttc 


ttgtcaattg 


cgactctaat 


gatggactca 


cgttgcccgc 


5236 
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tcttcccttt ctcttacacc ttacctacct actaaaggag gagttcttgc ttggtaagtg 5296 

gatataatcc gcaaagacat gagagaattt attagaagcc actcaagagc cttagctacc 5356 

ttctacaagg ggaaaaggac acacacaaat atctatagtg accctttttt tcgtttattt 5416 

ttggtcagac tgtttaactt ccattttttt tgtcccctcc tttcttttcc cctttagttg 5476 

aaaactgcta aaatgtcagt ttgcgacctt gactttatat ttaaaaaaaa aaaaaaaaaa 5536 

<210> 2 

<211> 135 

<212> PRT 

<213> Homo sapiens 

<400> 2 

Met Ala Asp Phe Gin Leu Pro Asp Ser He Leu Trp Cys Gin Asn Leu 
15 10 15 

Phe Ser Leu Pro Val Trp Gin Ser Gly His Thr Arg Glu Leu Met Leu 
20 25 30 

Gin Cys Ser Ala Leu Ser Asp Gly Glu Leu Val Tyr Lys Tyr Pro Ser 
35 40 45 

Ser Leu Thr Pro Trp Leu Arg Leu Thr Leu Gly Cys Met Phe Tyr Thr 
50 55 60 

Gly Ser Gin Gly Val Leu Thr Glu He Lys His Pro Leu Pro Thr Val 
65 70 75 80 

He Ala Gly Leu He Met His Leu Leu Leu ser Pro Ser Leu Leu Cys 
85 90 95 

He He Ser Thr Leu His Tyr Arg Gly Ser Leu Pro Ser Gin He He 
100 105 110 

Ser Thr His Phe Pro Ser Arg Leu Leu Arg He Gin He Arg Gin Ala 
115 120 125 

Leu Phe His Leu Ala He Lys 
130 135 

<210> 3 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 3 

ggagctaacg aaagcatgtc tgaaac 26 

<210> 4 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 4 

ggcaaggaac ctctgtaatg gagtg 25 

<210> 5 

<211> 50 

<212> DNA 

<213> Homo sapiens 

<400> 5 

tgataatgta tctaactggg tcccggtggg gatttctgag aacaggtggg 50 
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<210> 6 

<211> 50 

<212> DNA 

<213> Homo sapiens 

<400> 6 

aaaaacaata aataacagtg ccggcttccc tggagggctg agagagactc 



<210> 7 

<211> 50 

<212> DNA 

<213> Homo sapiens 

<400> 7 

aattaaattt taaaatgtct gagattgact accctgaccc agagagaaag 50 



<210> 8 

<211> 50 

<212> DNA 

<213> Homo sapiens 

<400> 8 

agaaagagtt tataataact ctgatgtctg gatatggttt aatggatccg 50 



<210> 9 

<211> 50 

<212> DNA 

<213> Homo sapiens 

<400> 9 

ccaagaggga ttaaaaaggt cgagatggga gagatggagc aatacacttc 50 



<210> 10 

<211> 50 

<212> DNA 

<213> Homo sapiens 

<400> 10 

gctcatgccc catctaaaag gggcagctgt aattgctctc agctgattct 50 



<210> 
<211> 
<212> 



11 

2270 
DNA 



<213> Homo sapiens 



<220> 
<221> 
<222> 
<223> 



CDS 

(157) . . (1080) 



<400> 11 

agacagcaga gaggctgccc tgctgcaatg tcaccgtcgt cactgcctct gcaggctgca 
ggcacctgcc actactgcag aggactgagg ggccttggcc cagcagggac cccagggcct 



tgggggactg tgtgagctgg aaacgtggct ggccag atg 

Met 
1 

ccc cct ggg ggt gcg tac ctg cac ctg ggc gcc 
Pro Pro Gly Gly Ala Tyr Leu His Leu Gly Ala 
10 15 

ggc aca gcc cgc gtg ctg cag ctg gcc ttt ggc 
Gly Thr Ala Arg Val Leu Gin Leu Ala Phe Gly 
25 30 

Ctg gtg get cac egg ggt ggc ttt gcg ggc gtc 
Leu Val Ala His Arg Gly Gly Phe Ala Gly Val 
40 45 



ggc age acc atg gag 
Gly Ser Thr Met Glu 
5 

gtg aca tec cct gtg 
Val Thr Ser Pro Val 
20 

tgc act acc ttc age 
Cys Thr Thr Phe Ser 
35 

cag ggc acc ttc tgc 
Gin Gly Thr Phe Cys 
50 



atg gcc gcc tgg ggc ttc tgc ttc gcc gtc tet gcg ctg gtg gtg gcc 
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Met Ala Ala Trp Gly Phe Cys Phe Ala Val Ser Ala Leu Val Val Ala 
55 60 65 70 

tgt gag ttc aca egg etc cac ggc tge etg egg etc tec tgg ggc aac 414 
Cys Glu Phe Thr Arg Leu His Gly Cys Leu Arg Leu Ser Trp Gly Asn 
75 80 85 

ttc acc gcc gcc ttc gcc atg ctg gcc acc ctg eta tgc gcg acg get 4 62 

Phe Thr Ala Ala Phe Ala Met Leu Ala Thr Leu Leu Cys Ala Thr Ala 
90 95 100 

gcg gtc etg tat ccg ctg tac ttt gcc egg egg gag tgt ccc ccc gag 510 
Ala Val Leu Tyr Pro Leu Tyr Phe Ala Arg Arg Glu Cys Pro Pro Glu 
105 110 lis 

ccc gcc ggc tgt get gcc agg gac ttc cgc ctg gca gcc agt gtc ttc 558 
Pro Ala Gly Cys Ala Ala Arg Asp Phe Arg Leu Ala Ala Ser Val Phe 
120 125 130 

gcc ggg etc etc ttc ctg gcc tac get gtg gag gtg gcc ctg acg egg 606 
Ala Gly Leu Leu Phe Leu Ala Tyr Ala Val Glu Val Ala Leu Thr Arg 
135 140 145 150 

gcc egg ccc ggc cag gtg age age tat atg gcc acg gtg teg ggg etc 654 
Ala Arg Pro Gly Gin Val Ser Ser Tyr Met Ala Thr Val Ser Gly Leu 

155 160 165 

etc aag ate gtc cag gee ttc gtg gcc tgc ate ate ttc ggg gcg ctg 702 
Leu Lys He Val Gin Ala Phe Val Ala Cys He He Phe Gly Ala Leu 
170 175 180 

gtc eat gac age cgc tac ggg cgc tac gtg gee ace cag tgg tgc gtg 750 
Val His Asp Ser Arg Tyr Gly Arg Tyr Val Ala Thr Gin Trp Cys Val 
185 190 195 

gcc gtc tac age ctg tgc ttc ctg gcc aca gtg gcc gtg gtg gcc ctg 798 
Ala Val Tyr Ser Leu Cys Phe Leu Ala Thr Val Ala Val Val Ala Leu 
200 205 210 

agt gtg atg ggc cac aca ggg ggc etg ggc tgc ccc ttt gac egg ctg 846 
Ser Val Met Gly His Thr Gly Gly Leu Gly Cys Pro Phe Asp Arg Leu 

215 220 225 230 

gtg gtg gtg tac acc ttc ctg get gtg etc ctg tac etc age gcc gee 894 
Val Val Val Tyr Thr Phe Leu Ala Val Leu Leu Tyr Leu Ser Ala Ala 
235 240 245 

gtg ate tgg cea gtc ttc tgt ttc gat ccc aag tac ggt gag ccc aaa 942 
Val lie Trp Pro Val Phe Cys Phe Asp Pro Lys Tyr Gly Glu Pro Lys 
250 255 260 

egg eee ccc aac tgt get egg ggc age tgt ccc tgg gac age cag etg 990 
Arg Pro Pro Asn Cys Ala Arg Gly Ser Cys Pro Trp Asp Ser Gin Leu 

265 270 275 

gtg gtg gee ate ttc acc tac gtc aac ctg etc ctg tac gtc gtt gac 1038 
Val Val Ala He Phe Thr Tyr Val Asn Leu Leu Leu Tyr Val Val Asp 
280 285 290 



etc gcc tac tec cag agg att cgc ttc gtg ccc 
Leu Ala Tyr Ser Gin Arg He Arg Phe Val Pro 
295 300 305 


age ctg tag 
Ser Leu 


. 1080 


eecacagtgg 


cagcecaccc 


acctctgetc 


tctggccaee 


agctccaggt 


tctgcgaggg 


1140 


aaactcaggt 


gaaceaaagt 


gtcaagtage 


gageaggagg 


gaagcegegg 


gcteagggga 


1200 


gatgaagtac 


aacgggagag 


gtcgctcccc 


gacttccaga 


ctgggctcag 


agggaeaagg 


1260 


geaggacagt 


agggeagagg 


cectgagcag 


aegcagggga 


gagcctcacc 


taggccaatc 


1320 


eccaactcac 


tetctetcac 


catttcceag 


cgctcctggg 


getcagtget 


ggtcaetgga 


1380 


acatcataag 


ggaaaaettt 


aaagcagetg 


etgttgtgac 


ecattttgea 


getggggacg 


1440 


tggaggctag 


aagcatttgc 


gaggcaccca 


gtaeetccat 


tagtgtctgg 


agaccacagt 


1500 


eaggetgggt 


aaggcaaget 


gaaagaggaa 


tgaettccag 


aggagaegaa 


cecaggagag 


1560 
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tagctgggcc 


aggcaagggg 


agggaggggc 


gtggggagcc ggaaccactc 


ctggtitcaga 


1620 


gaggcagctc 


ccagaaccct 


gggaagagag 


ccctgcctgg ggagctgtgc 


cagctcacag 


1680 


agaaggagcc 


agaaatctgg 


accagcctct 


cccacccacg tccgggaagc 


ctctgggctg 


1740 


gtagccacag 


tgtttggaag 


caggcttcct 


ccggagtctc cgcaggctgg 


aggtggcgcg 


1800 


ggtggcctgg 


aagagcctgc 


agagcccggc 


ggcaccgagt gcacagtgga 


cggggaggac 


1860 


ctggttccgc 


catagccacg 


gagccccacc 


tggacactca ccattggctg 


tgacgaagca 


1920 


gcttcagcag 


tgcccggcgg 


gcatctgtgc 


actgtgggca tctgtggcac 


tgggagggag 


1980 


cccggctgag 


ggcggccgct 


ggacacagaa 


tctgggtact gcttgcctct 


gctcaagggt 


2040 


ccagttgccg 


aaactcctga 


cgccggggcc 


atcatcctcc aggctccagc 


cagcttctcc 


2100 


tgcacagaag 


cccagcctgg 


ticcagccagg 


agctgaccca ctggccaccc 


ctgagtccaa 


2160 


gccgggtggg 


cagtggcaca 


acagcccctc 


agcccattga ctgggcccca 


ttgacgtcct 


2220 


tgagcaggaa 


ataaatgctg 


acatttatac 


gtaaaaaaaa aaaaaaaaaa 




2270 



<210> 12 

<211> 307 

<212> PRT 

<213> Homo sapiens 

<400> 12 

Met Gly Ser Thr Met Glu Pro Pro Gly Gly Ala Tyr Leu His Leu Gly 
15 10 15 

Ala Val Thr Ser Pro Val Gly Thr Ala Arg Val Leu Gin Leu Ala Phe 
20 25 30 

Gly Cys Thr Thr Phe Ser Leu Val Ala His Arg Gly Gly Phe Ala Gly 
35 40 45 



Val Gin Gly Thr Phe Cys Met Ala Ala Trp Gly Phe Cys Phe Ala Val 
50 55 60 



Ser Ala Leu Val Val Ala Cys GXu Phe Thr Arg Leu His Gly Cys Leu 
65 70 75 80 



Arg Leu Ser Trp Gly Asn Phe Thr Ala Ala Phe Ala Met Leu Ala Thr 
85 90 95 



Leu Leu Cys Ala Thr Ala Ala Val Leu Tyr Pro Leu Tyr Phe Ala Arg 
100 105 110 



Arg Glu Cys Pro Pro Glu Fro Ala Gly Cys Ala Ala Arg Asp Phe Arg 
115 120 125 



Leu Ala Ala Ser Val Phe Ala Gly Leu Leu Phe Leu Ala Tyr Ala Val 
130 135 140 



Glu Val Ala Leu Thr Arg Ala Arg Pro Gly Gin Val Ser Ser Tyr Met 
145 150 155 160 



Ala Thr Val Ser Gly Leu Leu Lys He Val Gin Ala Phe Val Ala Cys 
165 170 175 

He He Phe Gly Ala Leu Val His Asp Ser Arg Tyr Gly Arg Tyr Val 
180 185 190 
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Ala Thr Gin Trp Cys Val Ala Val Tyr Ser Leu Cys Phe Leu Ala Thr 
195 200 205 

Val Ala Val Val Ala Leu Ser Val Met Gly His Thr Gly Gly Leu Gly 
210 215 220 

Cys Pro Phe Asp Arg Leu Val Val Val Tyr Thr Phe Leu Ala Val Leu 
225 230 235 240 

Leu Tyr Leu Ser Ala Ala Val lie Trp Pro Val Phe Cys Phe Asp Pro 
245 250 255 

Lys Tyr Gly Glu Pro Lys Arg Pro Pro Asn Cys Ala Arg Gly Ser Cys 
260 265 270 

Pro Trp Asp Ser Gin Leu val Val Ala lie Phe Thr Tyr Val Asn Leu 
275 280 285 

Leu Leu Tyr Val Val Asp Leu Ala Tyr Ser Gin Arg lie Arg Phe Val 
290 295 300 

Pro Ser Leu 
305 

<210> 13 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 13 

actgcagagg actgaggggc cttg 24 

<210> 14 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 14 

aagacactgg ttgccaggcg gaag 24 

<210> 15 

<211> 153 

<212> PRT 

<213> Mus musculus 

<400> 15 

Met Gly Ser Thr Met Glu Pro Pro Gly Gly Ala Tyr Leu His Leu Gly 
15 10 15 

Ala Val Thr Ser Pro Val Gly Thr Ala Arg Met Leu Gin Leu Ala Phe 
20 25 30 

Gly Cys Thr Thr Phe Ser Leu Val Ala His Arg Gly Gly Phe Gly Gly 
35 40 45 

Val Gin Gly Thr Phe Cys Met Ala Ala Trp Gly Phe Cys Phe Ala Phe 
50 55 60 

Ser Val Leu Val Val Ala Cys Glu Phe Thr Lys Leu His Ser Cys Leu 
65 70 75 80 



Arg Leu Ser Trp Gly Asn Phe Thr Ala Ala Phe Ala Met Leu Ala Thr 
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85 90 95 



Leu Leu Cys Ala Thr Ala Ala Val He Tyr Pro Leu Tyr Phe Thr Arg 
100 105 110 



Leu Glu Cys Pro Pro Glu Pro Ala Gly Cys Met Val Ala Pro Cys Gin 
115 120 125 



Arg Pro Ala Pro Glu Ser Pro Trp Lys Asp Asp Asp Val Met Thr Ala 
130 135 140 



Met Glu Tyr Leu Ser Arg His Pro Tbr 
145 150 



<210> 16 

<211> 6455 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (431).. (5554) 

<223> 

<400> 16 



gctttgtgca 


agaaagtgca 


agtttcccgt 


tctggcttca tttttgttcc 


cttttgcaat 


60 


cctcctggct 


cccccccaaa 


ccaagctagc 


aaagcaatgg ccccggttcc 


cccccaacgc 


120 


ctgacctgcg 


tttactggga 


ggagagcggg 


agagggagcg cgcattctgg 


agcaggctgc 


180 


tttgactccg 


accacaggct 


gttttgtgca 


ggctgtccct cttcttcaaa 


atcgtgcatc 


240 


ccctccccga 


agcagcaggc 


agtgtgcctc 


cattcagcca catttggtat 


gcatgagcac 


300 


ggctgcagag 


agaggggagg 


tggctgtttt 


aagaaggttc aggggctcag 


gcaaggctac 


360 


ttgactagtc 


ttccaagttc 


caggaagcct 


ctgccctaat ggaatttgca 


ggtgtggaga 


420 


tgaccatggg 


atg cca gag ccg tgg ggg acc gtt tat ttt eta 
Met Pro Glu Pro Trp Gly Thr Val Tyr Phe Leu 
1 5 10 


ggc att 
Gly He 


469 



get cag gtt ttc agt ttc ttg ttt tec tgg tgg aat ttg gaa ggg gtc 517 
Ala Gin Val Phe Ser Phe Leu Phe Ser Trp Trp Asn Leu Glu Gly Val 
15 20 25 

atg aat cag get gat get cct cga ccc eta aac tgg acc ate egg aag 565 
Met Asn Gin Ala Asp Ala Pro Arg Pro Leu Asn Trp Thr He Arg Lys 
30 35 40 45 

ctg tgc cac gca gcc ttt ctt cca tct gtc aga ctt ctg aag get cag 613 
Leu Cys His Ala Ala Phe Leu Pro Ser Val Arg Leu Leu Lys Ala Gin 
50 55 60 

aaa tee tgg ata gaa aga gea ttt tat aaa aga gaa tgt gtc eae ate 661 
Lys Ser Trp He Glu Arg Ala Phe Tyr Lys Arg Glu Cys Val His He 
65 70 75 

ata ccc age acc aaa gac ccc cat agg tgt tgc tgt ggg cgt ctg ata 709 
He Pro Ser Thr Lys Asp Pro His Arg Cys Cys Cys Gly Arg Leu He 
80 85 90 

ggc cag eat gtt ggc etc ace eee agt ate tec gtg ctt cag aat gag 757 
Gly Gin His Val Gly Leu Thr Pro Ser He Ser Val Leu Gin Asn Glu 
95 100 105 

aaa aat gaa agt cgc etc tec cga aat gae ate cag tct gaa aag tgg 805 
Lys Asn Glu Ser Arg Leu Ser Arg Asn Asp He Gin Ser Glu Lys Trp 
110 115 120 125 

tec ate age aaa eae act eaa etc age ect aeg gat get ttt ggg ace 853 
Ser He Ser Lys His Thr Gin Leu Ser Pro Thr Asp Ala Phe Gly Thr 
130 135 140 
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att gag ttc caa gga ggt ggc cat tec aac aaa gcc atg tat gtg cga 
lie Glu Phe Gin 61y Gly Gly His Ser Asn Lys Ala Met Tyr Val Arg 
145 150 155 



301 



gta tot ttt gat aca aaa cot gat etc etc tta cac ctg atg acc aag 949 
Val Ser Phe Asp Thr Lys Pro Asp Leu Leu Leu His Leu Met Thr Lys 
160 165 170 

gaa tgg cag ttg gag ctt ccc aag ctt etc ate tct gtc cat ggg ggc 997 
Glu Trp Gin Leu Glu Leu Pro Lys Leu Leu lie ser val His Gly Gly 
175 180 185 

ctg cag aac ttt gaa etc cag eca aaa etc aag caa gtc ttt ggg aaa 1045 
Leu Gin Asn Phe Glu Leu Gin Pro Lys Leu Lys Gin Val Phe Gly Lys 
190 195 200 205 

ggg etc ate aaa gca gca atg aca act gga gcg tgg ata ttc act gga 1093 
Gly Leu lie Lys Ala Ala Met Thr Thr Gly Ala Trp lie Phe Thr Gly 
210 215 220 

ggg gtt aac aca ggt gtt att cgt cat gtt ggc gat gcc ttg aag gat 1141 
Gly Val Asn Thr Gly Val lie Arg His Val Gly Asp Ala Leu Lys Asp 
225 230 235 

cat gcc tct aag tct cga gga aag ata tgc acc ata ggt att gcc ccc 1189 
His Ala Ser Lys Ser Arg Gly Lys lie Cys Thr lie Gly lie Ala Pro 
240 245 250 

tgg gga att gtg gaa aac cag gag gac etc att gga aga gat gtt gtc 1237 
Trp Gly lie Val Glu Asn Gin Glu Asp Leu lie Gly Arg Asp Val Val 
255 260 265 

egg cca tac cag acc atg tec aat ccc atg age aag etc act gtt etc 1285 
Arg Pro Tyr Gin Thr Met Ser Asn Pro Met Ser Lys Leu Thr Val Leu 
270 275 280 285 

aac age atg cat tec cac ttc att ctg get gac aac ggg acc act gga 1333 
Asn Ser Met His Ser His Phe lie Leu Ala Asp Asn Gly Thr Thr Gly 
290 295 300 

aaa tat gga gca gag gtg aaa ctt cga aga caa ctg gaa aag cat att 1381 
Lys Tyr Gly Ala Glu Val Lys Leu Arg Arg Gin Leu Glu Lys His lie 
305 310 315 

tea etc cag aag ata aac aca aga ate ggt caa ggt gtt cet gtg gtg 1429 
Ser Leu Gin Lys lie Asn Thr Arg lie Gly Gin Gly Val Pro Val Val 
320 325 330 

gca etc ata gtg gaa gga gga ccc aat gtg ate teg att gtt ttg gag 1477 
Ala Leu lie Val Glu Gly Gly Pro Asn Val lie Ser He Val Leu Glu 
335 340 345 

tac ctt cga gac acc cet ccc gtg eca gtg gtt gtc tgt gat ggg agt 1525 
Tyr Leu Arg Asp Thr Pro Pro Val Pro Val Val Val Cys Asp Gly Ser 
350 355 360 365 

gga egg gca teg gac ate ctg gcc ttt ggg cat aaa tac tea gaa gaa 1573 
Gly Arg Ala Ser Asp He Leu Ala Phe Gly His Lys Tyr Ser Glu Glu 
370 375 380 

ggc gga ctg ata aat gaa tct ttg agg gac cag ctg ttg gtg act ata 1621 
Gly Gly Leu He Asn Glu Ser Leu Arg Asp Gin Leu Leu Val Thr He 
385 390 395 

cag aag act ttc aca tac act cga acc caa get cag cat ctg ttc ate 1669 
Gin Lys Thr Phe Thr Tyr Thr Arg Thr Gin Ala Gin His Leu Phe He 
400 405 410 

ate etc atg gag tgc atg aag aag aag gaa ttg att aeg gta ttt egg 1717 
He Leu Met Glu Cys Met Lys Lys Lys Glu Leu He Thr Val Phe Arg 
415 420 425 

atg gga tea gaa gga cac cag gac att gat ttg get ate ctg aca get 1765 
Met Gly Ser Glu Gly His Gin Asp He Asp Leu Ala He Leu Thr Ala 
430 435 440 445 

tta etc aaa gga gcc aat gcc teg gcc cca gac caa ctg age tta get 1813 
Leu Leu Lys Gly Ala Asn Ala Ser Ala Pro Asp Gin Leu Ser Leu Ala 
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450 455 460 

tta gcc tgg aac aga gtc gac ate get cge age cag ate ttt att tac 1861 
Leu Ala Trp Asn Arg val Asp He Ala Arg Ser Gin He Phe He Tyr 
465 470 475 

ggg caa cag tgg ccg gtg gga tct ctg gag caa gcc atg ttg gat gcc 1909 
Gly Gin Gin Trp Pro Val Gly Ser Leu Glu Gin Ala Met Leu Asp Ala 
480 485 490 

tta gtt ctg gac aga gtg gat ttt gtg aaa tta etc ata gag aat gga 1957 
Leu Val Leu Asp Arg Val Asp Phe Val Lys Leu Leu He Glu Asn Gly 
495 500 505 

gta age atg cac cgt ttt etc ace ate tec aga eta gag gaa ttg tac 2005 
Val Ser Met His Arg Phe Leu Thr He Ser Arg Leu Glu Glu Leu Tyr 
510 515 520 525 

aat aeg aga eat ggg cec tea aat aca ttg tac cac ttg gtc agg gat 2053 
Asn Thr Arg His Gly Pro Ser Asn Thr Leu Tyr His Leu Val Arg Asp 
530 535 540 

gtc aaa aag ggg aac ctg ccc cca gac tac aga ate age ctg att gac 2101 
Val Lys Lys Gly Asn Leu Pro Pro Asp Tyr Arg He Ser Leu He Asp 
545 550 555 

ate ggc ctg gtg ate gag tac ctg atg ggc ggg get tat cgc tgc aac 2149 
He Gly Leu Val He Glu Tyr Leu Met Gly Gly Ala Tyr Arg Cys Asn 
560 565 570 

tac aeg cgc aag cge ttc egg ace etc tac cac aac etc tte ggc cec 2197 
Tyr Thr Arg Lys Arg Phe Arg Thr Leu Tyr His Asn Leu Phe Gly Pro 
575 580 585 

aag agg ccc aaa gcc ttg aaa ctg ctg gga atg gag gat gat att ccc 2245 
Lys Arg Pro Lys Ala Leu Lys Leu Leu Gly Met Glu Asp Asp He Pro 
590 595 600 605 

ttg agg ega gga aga aag aca ace aag aaa cgt gaa gaa gag gtg gae 2293 
Leu Arg Arg Gly Arg Lys Thr Thr Lys Lys Arg Glu Glu Glu Val Asp 
610 615 620 

att gae ttg gat gat ect gag ate aac cac ttc ccc tte ect tte eat 2341 
He Asp Leu Asp Asp Pro Glu He Asn His Phe Pro Phe Pro Phe His 
625 630 635 

gag etc atg gtg tgg get gtt etc atg aag egg cag aag atg gee ctg 2369 
Glu Leu Met Val Trp Ala Val Leu Met Lys Arg Gin Lys Met Ala Leu 
640 645 650 

ttc ttc tgg cag cac ggt gag gag gcc atg gee aag gee ctg gtg gcc 2437 
Phe Phe Trp Gin His Gly Glu Glu Ala Met Ala Lys Ala Leu Val Ala 
655 660 665 

tgc aag etc tgc aaa gcc atg get cat gag gcc tct gag aac gac atg 2485 
Cys Lys Leu Cys Lys Ala Met Ala His Glu Ala Ser Glu Asn Asp Met 
670 675 680 685 

gtt gae gae att tee cag gag ctg aat cac aat tec aga gae ttt ggc 2533 
Val Asp Asp He Ser Gin Glu Leu Asn His Asn Ser Arg Asp Phe Gly 

690 695 700 

cag ctg get gtg gag etc ctg gac cag tee tac aag cag gac gaa cag 2581 
Gin Leu Ala Val Glu Leu Leu Asp Gin Ser Tyr Lys Gin Asp Glu Gin 
705 710 715 

ctg gcc atg aaa ctg ctg aeg tat gag ctg aag aac tgg age aac gee 2629 
Leu Ala Met Lys Leu Leu Thr Tyr Glu Leu Lys Asn Trp Ser Asn Ala 
720 725 730 

aeg tgc ctg cag ett gee gtg get gee aaa cac cgc gac ttc ate gcg 2677 
Thr Cys Leu Gin Leu Ala Val Ala Ala Lys His Arg Asp Phe He Ala 
735 740 745 

cac aeg tgc age cag atg ctg etc ace gae atg tgg atg ggc egg etc 2725 
His Thr Cys Ser Gin Met Leu Leu Thr Asp Met Trp Met Gly Arg Leu 
750 755 760 765 

cgc atg cgc aag aac tea ggc etc aag gta att ctg gga att eta ett 2773 
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Arg Met Arg X»ys Asn Ser Gly Leu Lys Val lie Leu Gly lie Leu Leu 
770 775 780 

cct cct tea att etc age ttg gag ttc aag aac aaa gac gac atg ccc 2821 
Pro Pro Ser lie Leu Ser Leu Glu Phe Lys Asn Lys Asp Asp Met Pro 
785 790 795 

tat atg tot cag gee cag gaa ate cac etc caa gag aag gag gca gaa 2869 
Tyr Met Ser Gin Ala Gin Glu lie His Leu Gin Glu Lys Glu Ala Glu 
800 805 810 

gaa cca gag aag ccc aca aag gaa aaa gag gaa gag gac atg gag etc 2917 
Glu Pro Glu Lys Pro Thr Lys Glu Lys Glu Glu Glu Asp Met Glu Leu 
815 820 825 

aca gca atg ttg gga cga aac aac ggg gag tec tec agg aag aag gat 2965 
Thr Ala Met Leu Gly Arg Asn Asn Gly Glu Ser Ser Arg Lys Lys Asp 
830 835 840 845 



gaa gag gaa gtt cag age aag cac egg tta ate ccc etc ggc aga aaa 
Glu Glu Glu Val Gin Ser Lys His Arg Leu lie Pro Leu Gly Arg Lys 
850 855 860 



3013 



ate tat gaa tte tae aat gca ccc ate gtg aag ttc tgg ttc tac aca 
lie Tyr Glu Phe Tyr Asn Ala Pro lie Val Lys Phe Trp Phe Tyr Thr 
865 870 875 



3061 



ctg geg tat ate gga tae etg atg etc ttc aac tat ate gtg tta gtg 
Leu Ala Tyr lie Gly Tyr Leu Met Leu Phe Asn Tyr lie val Leu Val 
880 885 890 



3109 



aag atg gaa cge tgg ecg tec ace cag gaa tgg ate gta ate tec tat 
Lys Met Glu Arg Trp Pro Ser Thr Gin Glu Trp lie Val He Ser Tyr 
895 900 905 



3157 



att ttc acc ctg gga ata gaa aag atg aga gag att ctg atg tea gag 
He Phe Thr Leu Gly He Glu Lys Met Arg Glu He Leu Met Ser Glu 
910 915 920 925 



3205 



cca ggg aag ttg eta cag aaa gtg aag gta tgg ctg cag gag tac tgg 
Pro Gly Lys Leu Leu Gin Lys Val Lys Val Trp Leu Gin Glu Tyr Trp 
930 935 940 



3253 



aat gtc acg gac etc ate gee ate ctt ctg ttt tct gte gga atg ate 
Asn Val Thr Asp Leu He Ala He Leu Leu Phe Ser Val Gly Met He 
945 950 955 



3301 



ctt cgt etc caa gac cag ccc ttc agg agt gac ggg agg gtc ate tac 
Leu Arg Leu Gin Asp Gin Pro Phe Arg Ser Asp Gly Arg Val He Tyr 
960 965 970 



3349 



tge gtg aac ate att tac tgg tat ate cgt etc eta gac ate ttc ggc 
Cys Val Asn He He Tyr Trp Tyr He Arg Leu Leu Asp He Phe Gly 
975 980 985 



3397 



gtg aac aag tat ttg ggc ccg tat gta atg atg att gga aaa atg atg 
Val Asn Lys Tyr Leu Gly Pro Tyr Val Met Met He Gly Lys Met Met 
990 995 1000 1005 



3445 



ata gac atg atg tac ttt gtc ate att atg 
He Asp Met Met Tyr Phe val He He Met 
1010 1015 



ctg gtg gtt ctg atg 
Leu Val Val Leu Met 
1020 



3490 



age ttt ggg gtc gee agg caa gee ate ctt 
Ser Phe Gly Val Ala Arg Gin Ala He Leu 
1025 1030 



ttt ccc aat gag gag 
Phe Pro Asn Glu Glu 
1035 



3535 



cca tea tgg aaa etg gee aag aac ate tte tac atg ccc tat tgg 
Pro Ser Trp Lys Leu Ala Lys Asn He Phe Tyr Met Pro Tyr Trp 
1040 1045 1050 



3580 



atg att tat ggg gaa gtg ttt gcg gac cag ata gac cct ccc tgt 
Met He Tyr Gly Glu Val Phe Ala Asp Gin He Asp Pro Pro Cys 
1055 1060 1065 



3625 



gga cag aat gag acc cga gag gat ggt aaa 
Gly Gin Asn Glu Thr Arg Glu Asp Gly Lys 
1070 1075 



ata ate cag etg cct 
He He Gin Leu Pro 
1080 



3670 
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gee ate atg gee tgc 
Ala He Met Ala Cys 
1095 



3715 



tac etc tta gtg gca 
Tyr Leu Leu Val Ala 
1100 



aac ate ttg ctg gte 
Asn He Leu Leu Val 
1105 



aac etc etc att get 
Asn Leu Leu He Ala 
1110 



3760 



gte ttt aac aat aca 
Val Phe Asn Asn Thr 
1115 



ttt ttt gaa gta aaa teg ata tec aac caa 
Phe Phe Glu Val Lys Ser He Ser Asn Gin 
1120 1125 



3805 



gte tgg aag ttt cag agg tat eag etc ate atg act ttc cat gaa 
Val Trp Lys Phe Gin Arg Tyr Gin Leu He Met Thr Phe His Glu 
1130 1135 1140 



3850 



agg cca gtt ctg ccc 
Arg Pro Val Leu Pro 
1145 



cca cca ctg ate ate 
Pro Fro Leu He He 
1150 



ttc age cac atg ace 
Phe ser His Met Thr 
1155 



3895 



atg ata ttc eag cac ctg tgc tgc ega tgg 
Met He Phe Gin His Leu Cys Cys Arg Trp 
1160 1165 



agg aaa cac gag age 
Arg Lys His Glu Ser 
1170 



3940 



gac ccg gat gaa agg gae tac ggc ctg aaa etc ttc ata acc gat 
Asp Pro Asp Glu Arg Asp Tyr Gly Leu Lys Leu Phe He Thr Asp 
1175 1180 1185 



3985 



gat gag etc aag aaa gta cat gac ttt gaa 
Asp Glu Leu Lys Lys Val His Asp Phe Glu 
1190 1195 



gag caa tgc ata gaa 
Glu Gin Cys He Glu 
1200 



4030 



gaa tac ttc aga gaa aag gat gat egg ttc 
Glu Tyr Phe Arg Glu Lys Asp Asp Arg Phe 
1205 1210 



aac tea tet aat gat 
Asn Ser Ser Asn Asp 
1215 



4075 



gag agg ata egg gtg act tea gaa agg gtg gag aac atg tet atg 
Glu Arg He Arg Val Thr Ser Glu Arg Val Glu Asn Met Ser Met 
1220 1225 1230 



4120 



egg ctg gag gaa gte 
Arg Leu Glu Glu Val 
1235 



aac gag aga gag cac tec atg aag get tea 
Asn Glu Arg Glu His Ser Met Lys Ala Ser 
1240 1245 



4165 



etc cag acc gtg gae ate egg ctg gcg eag ctg gaa gac ett ate 
Leu Gin Thr Val Asp He Arg Leu Ala Gin Leu Glu Asp Leu He 
1250 1255 1260 



4210 



ggg cgc atg gee aeg gee ctg gag cgc ctg * aca ggt ctg gag egg 
Gly Arg Met Ala Thr Ala Leu Glu Arg Leu Thr Gly Leu Glu Arg 
1265 1270 1275 



4255 



gcc gag tec aac aaa ate cgc teg agg acc teg tea gae tgc aeg 
Ala Glu Ser Asn Lys He Arg Ser Arg Thr Ser Ser Asp Cys Thr 
1280 1285 1290 



4300 



gac gcc gcc tac att gte cgt cag age age ttc aac age cag gaa 
Asp Ala Ala Tyr He Val Arg Gin Ser Ser Phe Asn Ser Gin Glu 
1295 1300 1305 



4345 



ggg aac acc ttc aag 
Gly Asn Thr Phe Lys 
1310 



etc caa gag agt ata 
Leu Gin Glu Ser He 
1315 



gac cet gca ggt gag 
Asp Pro Ala Gly Glu 
1320 



4390 



gag acc atg tec cca 
Glu Thr Met Ser Pro 
1325 



act tet cca acc tta atg ccc cgt atg ega 
Thr Ser Pro Thr Leu Met Pro Arg Met Arg 
1330 1335 



4435 



age cat tet ttc tat teg gte aat atg aaa gac aaa ggt ggt ata 
Ser His Ser Phe Tyr Ser Val Asn Met Lys Asp Lys Gly Gly He 
1340 1345 1350 



4480 



gaa aag ttg gaa agt att ttt aaa gaa agg 
Glu Lys Leu Glu Ser He Phe Lys Glu Arg 
1355 1360 



tec ctg age eta cac 
Ser Leu Ser Leu His 
1365 



4525 



egg get act agt tec 
Arg Ala Thr Ser Ser 
1370 



cac tet gta gca aaa gaa ccc aaa get cet 
His Ser Val Ala Lys Glu Pro Lys Ala Pro 
1375 1380 



4570 
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gca gcc cct gcc aac acc ttg gcc att gtt cct gat tec aga aga 
Ala Ala Pro Ala Asn Thr Leu Ala lie Val ' Pro Asp Ser Arg Arg 
1385 1390 1395 

cca tea teg tgt ata gac ate tat gtc tct get atg gat gag etc 
Pro Ser Ser Cys He Asp He Tyr Val Ser Ala Met Asp Glu Leu 
1400 1405 1410 



cac tgt gat ata gac 
His Cys Asp He Asp 
1415 



cct ctg gac aat tec gtg aac ate ctt ggg 
Pro Leu Asp Asn Ser Val Asn He Leu Gly 
1420 1425 



4615 



4660 



4705 



ctg ggc gag cca age ttt tea act cca gta 
Leu Gly Glu Pro Ser Phe Ser Thr Pro Val 
1430 1435 



cct tec aea gcc cct 
Pro Ser Thr Ala Pro 
1440 



4750 



tea agt agt gee tat gca aea ctt gca cec aea gac aga cct cca 
Ser Ser Ser Ala Tyr Ala Thr Leu Ala Pro Thr Asp Arg Pro Pro 
1445 1450 1455 



4795 



age egg age att gat ttt gag gac ate acc tec atg gac act aga 4840 

Ser Arg Ser He Asp Phe Glu Asp He Thr Ser Met Asp Thr Arg 
1460 1465 1470 

tct ttt tct tea gac tae acc cac etc cca gaa tgc caa aac cec 4885 

Ser Phe Ser Ser Asp Tyr Thr His Leu Pro Glu Cys Gin Asn Pro 
1475 1480 1485 

tgg gac tea gag cct ecg atg tac cac ace att gag cgt tec aaa 4930 

Trp Asp Ser Glu Pro Pro Met Tyr His Thr He Glu Arg Ser Lys 
1490 1495 1500 



agt age cgc tac eta gcc acc aca cec ttt ctt eta gaa gag get 
Ser Ser Arg Tyr Leu Ala Thr Thr Pro Phe Leu Leu Glu Glu Ala 
1505 1510 1515 



4975 



ecc att gtg aaa tct eat age ttt atg ttt 
Pro He Val Lys Ser His Ser Phe Met Phe 
1520 1525 



tec cec tea agg age 
Ser Pro Ser Arg Ser 
1530 



5020 



tat tat gee aac ttt ggg gtg cct gta aaa aea gca gaa tae aea 
Tyr Tyr Ala Asn Phe Gly Val Pro Val Lys Thr Ala Glu Tyr Thr 
1535 1540 1545 



5065 



agt att aca gac tgt att gac aca agg tgt 
Ser He Thr Asp Cys He Asp Thr Arg Cys 
1550 1555 



gtc aat gee cct caa 
Val Asn Ala Pro Gin 
1560 



5110 



gca att gcg gac aga get gcc ttc cct gga ggt ctt gga gac aaa 
Ala He Ala Asp Arg Ala Ala Phe Pro Gly Gly Leu Gly Asp Lys 
1565 1570 1575 



5155 



gtg gag gac tta act tgc tgc cat cca gag cga gaa gca gaa ctg 
Val Glu Asp Leu Thr Cys Cys His Pro Glu Arg Glu Ala Glu Leu 
1580 1585 1590 



5200 



agt cac ecc age tct gac agt gag gag aat gag gcc aaa ggc cgc 
Ser His Pro Ser Ser Asp Ser Glu Glu Asn Glu Ala Lys Gly Arg 
1595 1600 1605 



5245 



aga gcc acc att gca ata tec tec cag gag ggt gat aac tea gag 
Arg Ala Thr He Ala He Ser Ser Gin Glu Gly Asp Asn Ser Glu 
1610 1615 1620 



5290 



aga ace ctg tec aac aac ate act gtt cec aag ata gag cgc gcc 
Arg Thr Leu Ser Asn Asn XI e Thr Val Pro Lys He Glu Arg Ala 
1625 1630 1635 



5335 



aac age tac teg gca gag gag cca agt gcg cca tat gca cac acc 
Asn Ser Tyr Ser Ala Glu Glu Pro Ser Ala Pro Tyr Ala His Thr 
1640 1645 1650 



5380 



agg aag age ttc tec 
Arg Lys Ser Phe Ser 
1655 



ate agt gac aaa etc gac agg cag egg aac 
He Ser Asp Lys Leu Asp Arg Gin Arg Asn 
1660 1665 



5425 



aca gca age ctg cga 
Thr Ala Ser Leu Arg 



aat cec ttc cag aga 
Asn Pro Phe Gin Arg 



age aag tec tec aag 
Ser Lys Ser Ser Lys 
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1670 1675 1680 

ccg gag ggc cga ggg gac age ctg tec atg agg aga ctg tee aga 5515 
Pro Glu Gly Arg Gly Asp Ser Leu Ser Met Arg Arg Leu Ser Arg 
1685 1690 1695 

aca teg get tte caa age ttt gaa age aag cac aac taa aecttcttaa 5564 
Thr Sec Ala Phe Gin Ser Phe Glu Ser Lys His Asn 





1700 




1705 








tatcegccac 


agaaggetea 


agaatceagc 


cctaaaattc 


tctccaactc 


cagtttttec 


5624 


cctttccttg 


aatcataeet 


gctttattct 


tagctgagea 


aaaeaageaa 


tgctttggga 


5684 


ggtgttaact 


caaaggtgac 


ttctgggcca 


cagatcaaga 


aagcatttga 


tctgacccag 


5744 


tgccagacac 


aggggattta 


aggcatgttc 


acacttgctg 


ggcagggagg 


gggaagagag 


5804 


ggagaaggaa 


gggttagaga 


tgaatgtgta 


tccgcagtca 


cagcagaaag 


ccatgagagc 


5864 


aggggaaaca 


aggggcttcg 


agcacgctcc 


atgccaggag 


gcatctgttg 


atttctgacc 


5924 


attatcaaga 


gttgtaggat 


gcagggctaa 


attgcaaaat 


aaaataaaat 


agccagcgta 


5984 


cacaatgaga 


tattetaaac 


ttccattctg 


ttttcttttc 


acattggctc 


catcactggt 


6044 


gactgatgaa 


gagcatcctc 


tttatteagt 


ataagceggc 


agcaagcagt 


tctaectaae 


6104 


gtcccacate 


etteteatgc 


caacacttct 


gtaattgatc 


attataaaga 


aaaaacaagg 


6164 


taaeagteat 


agttcacctg 


tetcttatct 


attcacttct 


ggtgccacaa 


ctgtttatcc 


6224 


ttttttgaag 


aaaataaggg 


aacagaaatg 


cctttttgta 


ttgcaatcga 


aatgaaagaa 


6284 


gagttgatgt 


taaaaaaaca 


aaagtcaagt 


gatttattat 


atacagtggg 


cgttcaagtc 


6344 


tagtcgagca 


agctcaggag 


aatgtaatta 


aataatttta 


tattttttaa 


tttattttgt 


6404 


atctcacctg 


tcatggatga 


attcattcac 


tgaatatgta 


atattgaact 


t 


6455 



<210> 17 

<211> 1707 

<212> PRT 

<213> Homo sapiens 

<400> 17 

Met Pro Glu Pro Trp Gly Thr Val Tyr Phe Leu Gly He Ala Gin Val 
15 10 15 

Phe Ser Phe Leu Phe Ser Trp Trp Asn Leu Glu Gly Val Met Asn Gin 
20 25 30 

Ala Asp Ala Pro Arg Pro Leu Asn Trp Thr He Arg Lys Leu Cys His 
35 40 45 

Ala Ala Phe Leu Pro Ser Val Arg Leu Leu Lys Ala Gin Lys Ser Trp 
50 55 60 

He Glu Arg Ala Phe Tyr Lys Arg Glu Cys Val His He He Pro Ser 
65 70 75 80 

Thr Lys Asp Pro His Arg Cys Cys Cys Gly Arg Leu He Gly Gin His 
85 90 95 

Val Gly Leu Thr Pro Ser He Ser Val Leu Gin Asn Glu Lys Asn Glu 
100 105 110 

ser Arg Leu Ser Arg Asn Asp He Gin Ser Glu Lys Trp Ser He Ser 
115 120 125 
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Lys His Thr Gin Leu Ser Pro Thr Asp Ala Phe Gly Thr He Glu Phe 
130 135 140 

Gin Gly Gly Gly His Ser Asn Lys Ala Met Tyr Val Arg Val Ser Phe 
145 150 155 160 

Asp Thr Lys Pro- Asp Leu Leu Leu His Leu Met Thr Lys Glu Trp Gin 
165 170 175 



Leu Glu Leu Pro Lys Leu Leu He Ser Val His Gly Gly Leu Gin Asn 
180 185 190 

Phe Glu Leu Gin Pro Lys Leu Lys Gin Val Phe Gly Lys Gly Leu He 
195 200 205 

Lys Ala Ala Met Thr Thr Gly Ala Trp He Phe Thr Gly Gly Val Asn 
210 215 220 

Thr Gly Val He Arg His Val Gly Asp Ala Leu Lys Asp His Ala Ser 
225 230 235 240 



Lys Ser Arg Gly Lys He Cys Thr He Gly He Ala Pro Trp Gly He 
245 250 255 



Val Glu Asn Gin Glu Asp Leu He Gly Arg Asp Val Val Arg Pro Tyr 
260 265 270 



Gin Thr Met Ser Asn Pro Met Ser Lys Leu Thr Val Leu Asn Ser Met 
275 280 285 



His Ser His Phe He Leu Ala Asp Asn Gly Thr Thr Gly Lys Tyr Gly 
290 295 300 



Ala Glu Val Lys Leu Arg Arg Gin Leu Glu Lys His He Ser Leu Gin 
305 310 315 320 



Lys He Asn Thr Arg He Gly Gin Gly Val Pro Val Val Ala Leu He 
325 330 335 



Val Glu Gly Gly Pro Asn Val He Ser He Val Leu Glu Tyr Leu Arg 
340 345 350 



Asp Thr Pro Pro Val Pro Val Val Val Cys Asp Gly Ser Gly Arg Ala 
355 360 365 



Ser Asp He Leu Ala Phe Gly His Lys Tyr Ser Glu Glu Gly Gly Leu 
370' 375 380 



He Asn Glu Ser Leu Arg Asp Gin Leu Leu Val Thr He Gin Lys Thr 
385 390 395 400 



Phe Thr Tyr Thr Arg Thr Gin Ala Gin His Leu Phe He He Leu Met 
405 410 415 



Glu Cys Met Lys Lys Lys Glu Leu He Thr Val Phe Arg Met Gly Ser 
420 425 430 

Glu Gly His Gin Asp He Asp Leu Ala He Leu Thr Ala Leu Leu Lys 
435 440 445 
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Gly Ala Asn Ala Ser Ala Pro Asp Gin Leu Ser Leu Ala Leu Ala Trp 
450 455 460 



Asn Arg Val Asp He Ala Arg Ser Gin He Phe He Tyr Gly Gin Gin 
465 470 475 480 

Trp Pro Val Gly Ser Leu Glu Gin Ala Met Leu Asp Ala Leu Val Leu 
485 490 495 



Asp Arg Val Asp Phe val Lys Leu Leu He Glu Asn Gly Val Ser Met 
500 505 510 



His Arg Phe Leu Thr He Ser Arg Leu Glu Glu Leu Tyr Asn Thr Arg 
515 520 525 



His Gly Pro Ser Asn Thr Leu Tyr His Leu Val Arg Asp val Lys Lys 
530 535 540 



Gly Asn Leu Pro Pro Asp Tyr Arg He Ser Leu He Asp He Gly Leu 
545 550 555 560 



Val He Glu Tyr Leu Met Gly Gly Ala Tyr Arg Cys Asn Tyr Thr Arg 
565 570 575 



Lys Arg Phe Arg Thr Leu Tyr His Asn Leu Phe Gly Pro Lys Arg Pro 
580 585 590 



Lys Ala Leu Lys Leu Leu Gly Met Glu Asp Asp He Pro Leu Krg Arg 
595 600 605 

Gly Arg Lys Thr Thr Lys Lys Arg Glu Glu Glu Val Asp He Asp Leu 
610 615 620 



Asp Asp Pro Glu He Asn His Phe Pro Phe Pro Phe His Glu Leu Met 
625 630 635 640 



val Trp Ala Val Leu Met Lys Arg Gin Lys Met Ala Leu Phe Phe Trp 
645 650 655 



Gin His Gly Glu Glu Ala Met Ala Lys Ala Leu Val Ala Cys Lys Leu 
660 665 670 



Cys Lys Ala Met Ala His Glu Ala Ser Glu Asn Asp Met Val Asp Asp 
675 680 685 



He Ser Gin Glu Leu Asn His Asn Ser Arg Asp Phe Gly Gin Leu Ala 
690 695 700 



Val Glu Leu Leu Asp Gin Ser Tyr Lys Gin Asp Glu Gin Leu Ala Met 
705 710 715 720 



Lys Leu Leu Thr Tyr Glu Leu Lys Asn Trp Ser Asn Ala Thr Cys Leu 
725 730 735 



Gin Leu Ala Val Ala Ala Lys His Arg Asp Phe He Ala His Thr Cys 
740 745 750 



Ser Gin Met Leu Leu Thr Asp Met Trp Met Gly Arg Leu Arg Met Arg 
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Lys Asn Ser Gly Leu Lys Val lie Leu Gly He Leu Leu Pro Pro Ser 
770 775 780 



He Leu ser Leu Glu Phe Lys Asn Lys Asp Asp Met Pro Tyr Met Ser 
785 790 795 800 

Gin Ala Gin Glu He His Leu Gin Glu Lys Glu Ala Glu Glu Pro Glu 
805 810 815 

Lys Pro Thr Lys Glu Lys Glu Glu Glu Asp Met Glu Leu Thr Ala Met 
820 825 830 

Leu Gly Arg Asn Asn Gly Glu Ser Ser Arg Lys Lys Asp Glu Glu Glu 
835 840 845 

Val Gin Ser Lys His Arg Leu He Pro Leu Gly Arg Lys He Tyr Glu 
850 855 860 



Phe Tyr Asn Ala Pro He Val Lys Phe Trp Phe Tyr Thr Leu Ala Tyr 
865 870 875 880 



He Gly Tyr Leu Met Leu Phe Asn Tyr He Val Leu val Lys Met Glu 
885 890 895 



Arg Trp Pro Ser Thr Gin Glu Trp He Val He Ser Tyr He Phe Thr 
900 905 910 



Leu Gly He Glu Lys Met Arg Glu He Leu Met Ser Glu Pro Gly Lys 
915 920 925 



Leu Leu Gin Lys Val Lys Val Trp Leu Gin Glu Tyr Trp Asn Val Thr 
930 935 940 



Asp Leu He Ala He Leu Leu Phe Ser Val Gly Met He Leu Arg Leu 
945 950 955 960 



Gin Asp Gin Pro Phe Arg Ser Asp Gly Arg Val He Tyr Cys Val Asn 
965 970 975 

He He Tyr Trp Tyr He Arg Leu Leu Asp He Phe Gly Val Asn Lys 
980 985 990 

Tyr Leu Gly Pro Tyr Val Met Met He Gly Lys Met Met He Asp Met 
995 1000 1005 

Met Tyr Phe Val He He Met Leu Val Val Leu Met Ser Phe Gly 
1010 1015 1020 

Val Ala Arg Gin Ala He Leu Phe Pro Asn Glu Glu Pro Ser Trp 
1025 1030 1035 

Lys Leu Ala Lys Asn He Phe Tyr Met Pro Tyr Trp Met He Tyr 
1040 1045 1050 

Gly Glu Val Phe Ala Asp Gin He Asp Pro Pro Cys Gly Gin Asn 
1055 1060 1065 
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Glu Thr hrg GIu Asp Gly Lys lie He Gin Leu Pro Pro Cys Lys 
1070 1075 1080 



Thr Gly Ala Trp He Val Pro Ala He Met Ala Cys Tyr I«eu Leu 
1085 1090 1095 



Val Ala Asn He Leu Leu Val Asn Leu Leu He Ala Val Phe Asn 
1100 1105 1110 



Asn Thr Phe Phe Glu Val Lys Ser He Ser Asn Gin Val Trp Lys 
1115 1120 1125 



Phe Gin Arg Tyr Gin Leu He Met Thr Phe His Glu Arg Pro Val 
1130 1135 1140 



Leu Pro Pro Pro Leu He He Phe Ser His Met Thr Met He Phe 
1145 1150 1155 



Gin His Leu Cys Cys Arg Trp Arg Lys His Glu Ser Asp Pro Asp 
1160 1165 1170 



Glu Arg Asp Tyr Gly Leu Lys Leu Phe He Thr Asp Asp Glu Leu 
1175 1180 1185 



Lys Lys Val His Asp Phe Glu Glu Gin Cys He Glu Glu Tyr Phe 
1190 1195 1200 

Arg Glu Lys Asp Asp Arg Phe Asn Ser Ser Asn Asp Glu Arg He 
1205 1210 1215 



Arg Val Thr Ser Glu Arg Val Glu Asn Met Ser Met Arg Leu Glu 
1220 1225 1230 



Glu Val Asn Glu Arg Glu His Ser Met Lys Ala Ser Leu Gin Thr 
1235 1240 1245 



Val Asp He Arg Leu Ala Gin Leu Glu Asp Leu He Gly Arg Met 
1250 1255 1260 



Ala Thr Ala Leu Glu Arg Leu Thr Gly Leu Glu Arg Ala Glu Ser 
1265 1270 1275 



Asn Lys He Arg Ser Arg Thr Ser Ser Asp Cys Thr Asp Ala Ala 
1280 1285 1290 



Tyr He Val Arg Gin Ser Ser Phe Asn Ser Gin Glu Gly Asn Thr 
1295 1300 1305 



Phe Lys Leu Gin Glu Ser He Asp Pro Ala Gly Glu Glu Thr Met 
1310 1315 1320 



Ser Pro Thr Ser Pro Thr Leu Met Pro Arg Met Arg Ser His Ser 
1325 1330 1335 



Phe Tyr ser Val Asn Met Lys Asp Lys Gly Gly He Glu Lys Leu 
1340 1345 1350 



Glu Ser He Phe Lys Glu Arg Ser Leu Ser Leu His Arg Ala Thr 
1355 1360 1365 
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Ser Ser His Ser Val Ala Lys Glu Pro Lys Ala Pro Ala Ala Pro 
1370 1375 1380 

Ala Asn Thr Leu Ala He val Pro Asp Ser Arg Arg Pro Ser Ser 
1385 1390 1395 

Cys He Asp He Tyr val Ser Ala Met Asp Glu Leu His Cys Asp 
1400 1405 1410 

He Asp Pro Leu Asp Asn Ser Val Asn He Leu Gly Leu Gly Glu 
1415 1420 1425 



Pro Ser Phe Ser Thr Pro Val Pro Ser Thr Ala Pro Ser Ser Ser 
1430 1435 1440 



Ala Tyr Ala Thr Leu Ala Pro Thr Asp Arg Pro Pro Ser Arg Ser 
1445 1450 1455 

He Asp Phe Glu Asp He Thr Ser Met Asp Thr Arg Ser Phe Ser 
1460 1465 1470 



Ser Asp Tyr Thr His Leu Pro Glu Cys Gin Asn Pro Trp Asp Ser 
1475 1480 1485 



Glu Pro Pro Met Tyr His Thr He Glu Arg Ser Lys Ser Ser Arg 
1490 1495 1500 



Tyr Leu Ala Thr Thr Pro Phe Leu Leu Glu Glu Ala Pro He Val 
1505 1510 1515 



Lys Ser His Ser Phe Met Phe Ser Pro Ser Arg Ser Tyr Tyr Ala 
1520 1525 1530 

Asn Phe Gly Val Pro Val Lys Thr Ala Glu Tyr Thr Ser He Thr 
1535 1540 1545 



Asp Cys He Asp Thr Arg Cys Val Asn Ala Pro Gin Ala He Ala 
1550 1555 1560 



Asp Arg Ala Ala Phe Pro Gly Gly Leu Gly Asp Lys Val Glu Asp 
1565 1570 1575 



Leu Thr Cys Cys His Pro Glu Arg Glu Ala Glu Leu Ser His Pro 
1580 1585 1590 



Ser Ser Asp Ser Glu Glu Asn Glu Ala Lys Gly Arg Arg Ala Thr 
1595 1600 1605 • 



He Ala He Ser Ser Gin Glu Gly Asp Asn Ser Glu Arg Thr Leu 
1610 1615 1620 



Ser Asn Asn He Thr Val Pro Lys He Glu Arg Ala Asn Ser Tyr 
1625 1630 1635 

Ser Ala Glu Glu Pro Ser Ala Pro Tyr Ala His Thr Arg Lys Ser 
1640 1645 1650 



Phe Ser He Ser Asp Lys Leu Asp Arg Gin Arg Asn Thr Ala Ser 
1655 1660 1665 
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Leu Arg Asn Pro Phe Gin Arg Ser Lys Ser Ser Lys Pro Glu Gly 
1670 1675 1680 

Arg Gly Asp Ser Leu Ser Met Arg Arg Leu Ser Arg Thr Ser Ala 
1685 1690 1695 

Phe Gin Ser Phe Glu Ser Lys His Asn 
1700 1705 



<210> 18 

<211> 988 

<212> PRT 

<213> Homo sapiens 

<400> 18 

Met Lys Leu Leu Thr Tyr Glu Leu Lys Asn Trp Ser Asn Ala Thr Cys 
15 10 IS 



Leu Gin Leu Ala val Ala Ala Lys His Arg Asp Phe lie Ala His Thr 
20 25 30 



Cys Ser Gin Met Leu Leu Thr 2Vsp Met Trp Met Gly Arg Leu Arg Met 
35 40 45 



Arg Lys Asn Ser Gly Leu Lys Val lie Leu Gly lie Leu Leu Pro Pro 
50 55 60 



Ser He Leu Ser Leu Glu Phe Lys Asn Lys Asp Asp Met Pro Tyr Met 
65 70 75 80 



Ser Gin Ala Gin Glu He His Leu Gin Glu Lys Glu Ala Glu Glu Pro 
85 90 95 



Glu Lys Pro Thr Lys Glu Lys Glu Glu Glu Asp Met Glu Leu Thr Ala 
100 105 110 



Met Leu Gly Arg Asn Asn Gly Glu Ser Ser Arg Lys Lys Asp Glu Glu 
115 120 125 



Glu Val Gin Ser Lys His Arg Leu He Pro Leu Gly Arg Lys lie Tyr 
130 135 140 



Glu Phe Tyr Asn Ala Pro He Val Lys Phe Trp Phe Tyr Thr Leu Ala 
145 150 155 160 



Tyr He Gly Tyr Leu Met Leu Phe Asn Tyr He Val Leu val Lys Met 
165 170 175 



Glu Arg Trp Pro Ser Thr Gin Glu Trp He Val He Ser Tyr He Phe 
180 185 190 



Thr Leu Gly He Glu Lys Met Arg Glu He Leu Met Ser Glu Fro Gly 
195 200 205 



Lys Leu Leu Gin Lys Val Lys Val Trp Leu Gin Glu Tyr Trp Asn Val 

210 215 220 

Thr Asp Leu He Ala He Leu Leu Phe Ser Val Gly Met He Leu Arg 

225 230 235 240 
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Leu Gin Asp Gin Pro Phe Arg Ser Asp Gly Arg Val lie Tyr Cys Val 
245 250 255 



Asn He lie Tyr Trp Tyr lie Arg Leu Leu Asp He Phe Gly val Asn 
260 265 270 

Lys Tyr Leu Gly Pro Tyr val Met Met He Gly Lys Met Met He Asp 
275 280 285 



Met Met Tyr Phe Val He He Met Leu Val Val Leu Met Ser Phe Gly 
290 295 300 



Val Ala Arg Gin Ala He Leu Phe Pro Asn Glu Glu Pro Ser Trp Lys 

305 310 315 320 

Leu Ala Lys Asn He Phe Tyr Met Pro Tyr Trp Met He Tyr Gly Glu 

325 330 335 



Val Phe Ala Asp Gin He Asp Pro Pro Cys Gly Gin Asn Glu Thr Arg 
340 345 350 



Glu Asp Gly Lys He He Gin Leu Pro Pro Cys Lys Thr Gly Ala Trp 
355 360 365 



He Val Pro Ala He Met Ala Cys Tyr Leu Leu Val Ala Asn He Leu 
370 375 380 



Leu Val Asn Leu Leu He Ala Val Phe Asn Asn Thr Phe Phe Glu Val 
385 390 395 400 



Lys Ser He Ser Asn Gin Val Trp Lys Phe Gin Arg Tyr Gin Leu He 
405 410 415 



Met Thr Phe His Glu Arg Pro val Leu Pro Pro Pro Leu He He Phe 
420 425 430 



Ser His Met Thr Met He Phe Gin His Leu Cys Cys Arg Trp Arg Lys 
435 440 445 



His Glu Ser Asp Pro Asp Glu Arg Asp Tyr Gly Leu Lys Leu Phe He 
450 455 460 



Thr Asp Asp Glu Leu Lys Lys Val His Asp Phe Glu Glu Gin Cys He 
465 470 475 480 



Glu Glu Tyr Phe Arg Glu Lys Asp Asp Arg Phe Asn Ser Ser Asn Asp 
485 490 495 



Glu Arg He Arg Val Thr Ser Glu Arg Val Glu Asn Met Ser Met Arg 

500 505 510 

Leu Glu Glu Val Asn Glu Arg Glu His Ser Met Lys Ala Ser Leu Gin 
515 520 525 



Thr Val Asp He Arg Leu Ala Gin Leu Glu Asp Leu He Gly Arg Met 
530 535 540 



Ala Thr Ala Leu Glu TVrg Leu Thr Gly Leu Glu Arg Ala Glu Ser Asn 
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545 550 555 560 



Lys lie Arg Ser Arg Thr Ser Ser Asp Cys Thr Asp Ala Ala Tyr lie 
565 570 575 



Val Arg Gin Ser Ser Phe Asn Ser Gin Glu Gly Asn Thr Phe Lys Leu 
580 585 590 



Gin Glu Ser lie Asp Pro Ala Gly Glu Glu Thr Met Ser Pro Thr Ser 
595 600 605 



Pro Thr Leu Met Pro Arg Met Arg Ser His Ser Phe Tyr Ser Val Asn 
610 615 620 



Met Lys Asp Lys Gly Gly lie Glu Lys Leu Glu Ser lie Phe Lys Glu 
625 630 635 640 



Arg Ser Leu Ser Leu His Arg Ala Thr Ser Ser His Ser Val Ala Lys 
645 650 655 

Glu Pro Lys Ala Pro Ala Ala Pro Ala Asn Thr Leu Ala lie Val Pro 
660 665 670 



Asp Ser Arg Arg Pro Ser Ser Cys He Asp He Tyr Val Ser Ala Met 
675 680 685 



Asp Glu Leu His Cys Asp He Asp Pro Leu Asp Asn Ser Val Asn He 
690 695 700 



Leu Gly Leu Gly Glu Pro Ser Phe Ser Thr Pro Val Pro Ser Thr Ala 
705 710 715 720 



Pro Ser Ser Ser Ala Tyr Ala Thr Leu Ala Pro Thr Asp Arg Pro Pro 
725 730 735 



Ser Arg Ser He Asp Phe Glu Asp He Thr Ser Met Asp Thr Arg Ser 
740 745 750 



Phe Ser Ser Asp Tyr Thr His Leu Pro Glu Cys Gin Asn Pro Trp Asp 
755 760 765 



Ser Glu Pro Pro Met Tyr His Thr He Glu Arg Ser Lys Ser Ser Arg 
770 775 780 



Tyr Leu Ala Thr Thr Pro Phe Leu Leu Glu Glu Ala Pro He Val Lys 
785 790 795 800 



Ser His Ser Phe Met Phe Ser Pro Ser Arg Ser Tyr Tyr Ala Asn Phe 
805 810 815 



Gly Val Pro Val Lys Thr Ala Glu Tyr Thr Ser He Thr Asp Cys He 
820 825 830 

Asp Thr Arg Cys Val Asn Ala Pro Gin Ala He Ala Asp Arg Ala Ala 
835 840 845 

Phe Pro Gly Gly Leu Gly Asp Lys Val Glu Asp Leu Thr Cys Cys His 
850 855 860 
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Pro Glu Arg Glu Ala Glu Leu Ser His Pro Ser Ser Asp Ser Glu Glu 
865 870 875 880 



Asn Glu Ala Lys Gly Arg Arg Ala Thr He Ala He Ser Ser Gin Glu 
885 890 895 



Gly Asp Asn Ser Glu Arg Thr Leu Ser Asn Asn He Thr Val Pro Lys 
900 905 910 



He Glu Arg Ala Asn Ser Tyr Ser Ala Glu Glu Pro Ser Ala Pro Tyr 
915 920 925 



Ala His Thr Arg Lys Ser Phe Ser He Ser Asp Lys Leu Asp Arg Gin 
930 935 940 



Arg Asn Thr Ala Ser Leu Arg Asn Pro Phe Gin Arg Ser Lys Ser Ser 
945 950 955 960 



Lys Pro Glu Gly Arg Gly Asp Ser Leu Ser Met Arg Arg Leu Ser Arg 
965 970 975 



Thr Ser Ala Phe Gin Ser Phe Glu Ser Lys His Asn 
980 985 



<210> 19 

<211> 1017 

<212> PRT 

<213> Homo sapiens 

<400> 19 

Gin Glu Leu Asn His Asn Ser Arg Asp Phe Gly Gin Leu Ala Val Glu 
15 10 15 



Leu Leu Asp Gin Ser Tyr Lys Gin Asp Glu Gin Leu Ala Met Lys Leu 
20 25 30 



Leu Thr Tyr Glu Leu Lys Asn Trp Ser Asn Ala Thr Cys Leu Gin Leu 
35 40 45 



Ala Val Ala Ala Lys His Arg Asp Phe He Ala His Thr Cys Ser Gin 
50 55 60 



Met Leu Leu Thr Asp Met Trp Met Gly Arg Leu Arg Met Arg Lys Asn 
65 70 75 80 



Ser Gly Leu Lys Val He Leu Gly He Leu Leu Pro Pro Ser He Leu 
85 90 95 



Ser Leu Glu Phe Lys Asn Lys Asp Asp Met Pro Tyr Met Ser Gin Ala 
100 105 110 

Gin Glu He His Leu Gin Glu Lys Glu Ala Glu Glu Pro Glu Lys Pro 
115 120 125 

Thr Lys Glu Lys Glu Glu Glu Asp Met Glu Leu Thr Ala Met Leu Gly 
130 135 140 



Arg Asn Asn Gly Glu Ser Ser Arg Lys Lys Asp Glu Glu Glu Val Gin 
145 150 155 160 
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Ser Itys His Arg Leu lie Pro Leu Gly Arg Lys lie Tyr Glu Phe Tyr 
165 170 175 



Asn Ala Pro lie Val Lys Phe Trp Phe Tyr Thr Leu Ala Tyr lie Gly 
180 185 190 



Tyr Leu Met Leu Phe Asn Tyr lie Val Leu Val Lys Met Glu Arg Trp 
195 200 205 



Pro Ser Thr Gin Glu Trp He Val He Ser Tyr He Phe Thr Leu Gly 
210 215 220 



He Glu Lys Met Arg Glu He Leu Met Ser Glu Pro Gly Lys Leu Leu 
225 230 235 240 



Gin Lys Val Lys Val Trp Leu Gin Glu Tyr Trp Asn Val Thr Asp Leu 
245 250 255 



He Ala He Leu Leu Phe Ser Val Gly Met He Leu Arg Leu Gin Asp 
260 265 270 



Gin Pro Phe Arg Ser Asp Gly Arg Val He Tyr Cys Val Asn He He 
275 280 285 



Tyr Trp Tyr He Arg Leu Leu Asp He Phe Gly Val Asn Lys Tyr Leu 
290 295 300 



Gly Pro Tyr Val Met Met He Gly Lys Met Met He Asp Met Met Tyr 
305 310 315 320 



Phe Val He He Met Leu Val Val Leu Met Ser Phe Gly Val Ala Arg 
325 330 335 



Gin Ala He Leu Phe Pro Asn Glu Glu Pro Ser Trp Lys Leu Ala Lys 
340 345 350 



Asn He Phe Tyr Met Pro Tyr Trp Met He Tyr Gly Glu Val Phe Ala 
355 360 365 



Asp Gin He Asp Pro Pro Cys Gly Gin Asn Glu Thr Arg Glu Asp Gly 
370 375 380 



Lys He He Gin Leu Pro Pro Cys Lys Thr Gly Ala Trp He Val Pro 
385 390 395 400 



Ala He Met Ala Cys Tyr Leu Leu Val Ala Asn He Leu Leu Val Asn 
405 410 415 



Leu Leu He Ala Val Phe Asn Asn Thr Phe Phe Glu Val Lys Ser He 
420 425 430 



Ser Asn Gin Val Trp Lys Phe Gin Arg Tyr Gin Leu He Met Thr Phe 
435 440 445 



His Glu Arg Pro Val Leu Pro Pro Pro Leu He He Phe Ser His Met 
450 455 460 

Thr Met He Phe Gin His Leu Cys Cys Arg Trp Arg Lys His Glu Ser 
465 470 475 480 
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Asp Pro Asp Glu Arg Asp Tyr Gly Leu Lys Leu Phe lie Thr Asp Asp 
485 490 495 



Glu Leu Lys Lys Val His Asp Phe Glu Glu Gin Cys lie Glu Glu Tyr 
500 505 510 



Phe Arg Glu Lys Asp Asp Arg Phe Asn Ser Ser Asn Asp Glu Arg lie 
515 520 525 



Arg Val Thr Ser Glu Arg Val Glu Asn Met Ser Met Arg Leu Glu Glu 
530 535 540 



Val Asn Glu Arg Glu His Ser Met Lys Ala Ser Leu Gin Thr Val Asp 
545 550 555 560 



lie Arg Leu Ala Gin Leu Glu Asp Leu lie Gly Arg Met Ala Thr Ala 
565 570 575 



Leu Glu Arg Leu Thr Gly Leu Glu Arg Ala Glu Ser Asn Lys lie Arg 
580 585 590 



Ser Arg Thr Ser Ser Asp Cys Thr Asp Ala Ala Tyr lie Val Arg Gin 
595 600 605 



Ser Ser Phe Asn Ser Gin Glu Gly Asn Thr Phe Lys Leu Gin Glu Ser 
610 615 620 



lie Asp Fro Ala Gly Glu Glu Thr Met Ser Pro Thr Ser Pro Thr Leu 
625 630 635 640 



Met Pro Arg Met Arg Ser His Ser Phe Tyr Ser Val Asn Met Lys Asp 
645 650 655 



Lys Gly Gly lie Glu Lys Leu Glu Ser He Phe Lys Glu Arg Ser Leu 
660 665 670 



Ser Leu His Arg Ala Thr Ser Ser His Ser Val Ala Lys Glu Pro Lys 
675 680 685 



Ala Pro Ala Ala Pro Ala Asn Thr Leu Ala He Val Pro Asp Ser Arg 
690 695 700 



Arg Pro Ser Ser Cys lie Asp He Tyr Val Ser Ala Met Asp Glu Leu 

705 710 715 720 

His Cys Asp He Asp Pro Leu Asp Asn Ser Val Asn He Leu Gly Leu 
725 730 735 



Gly Glu Pro Ser Phe Ser Thr Pro Val Pro Ser Thr Ala Pro Ser Ser 
740 7.45 750 



Ser Ala Tyr Ala Thr Leu Ala Pro Thr Asp Arg Pro Pro Ser Arg Ser 
755 760 765 



He Asp Phe Glu Asp He Thr Ser Met Asp Thr Arg Ser Phe Ser Ser 
770 775 780 

Asp Tyr Thr His Leu Pro Glu Cys Gin Asn Pro Trp Asp Ser Glu Pro 
785 790 795 800 
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Pro Met Tyr His Thr lie Glu Arg Ser Lys Ser Ser Arg Tyr Leu Ala 
805 810 815 

Thr Thr Pro Phe Leu Leu Glu Glu Ala Pro lie Val Lys Ser His Ser 
820 825 830 

Phe Met Phe Ser Pro Ser Arg Ser Tyr Tyr Ala Asn Phe Gly Val Pro 
835 840 845 

Val Lys Thr Ala Glu Tyr Thr Ser He Thr Asp Cys He Asp Thr Arg 
850 855 860 

Cys Val Asn Ala Pro Gin Ala He Ala Asp Arg Ala Ala Phe Pro Gly 
865 870 875 880 

Gly Leu Gly Asp Lys Val Glu Asp Leu Thr Cys Cys His Pro Glu Arg 
885 890 895 

Glu Ala Glu Leu Ser His Pro Ser ser Asp Ser Glu Glu Asn Glu Ala 
900 905 910 

Lys Gly Arg Arg Ala Thr He Ala He Ser Ser Gin Glu Gly Asp Asn 
915 920 925 

Ser Glu Arg Thr Leu Ser Asn Asn He Thr Val Pro Lys He Glu Arg 
930 935 940 

Ala Asn Ser Tyr Ser Ala Glu Glu Pro Ser Ala Pro Tyr Ala His Thr 
945 950 955 960 

Arg Lys Ser Phe Ser He Ser Asp Lys Leu Asp Arg Gin Arg Asn Thr 
965 970 975 

Ala Ser Leu Arg Asn Pro Phe Gin Arg Ser Lys Ser Ser Lys Pro Glu 
980 985 990 

Gly Arg Gly Asp Ser Leu Ser Met Arg Lys Leu Ser Arg Thr Ser Ala 
995 1000 1005 



Phe Gin Ser Phe Glu Ser Lys His Thr 
1010 1015 



<210> 20 

<211> 736 

<212> PRT 

<213> Mus musculus 

<400> 20 

Met Val Leu Gly Thr Gly Thr Phe Leu Ser Ser Gin His Thr Ala Gly 
15 10 15 



Arg Leu Pro Pro Gly Ala Phe Ala Lys Gin Arg Leu Leu Cys Gly Ala 
20 25 30 



Ala Leu Leu Leu Tyr Val Ser Ala Asn Asn Pro He Gin Ala Gin Ser 
35 40 45 

Val Pro He Met Leu Ser Gin Arg Gly Leu Leu Ala Thr Cys Thr His 
, 50 55 60 
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Ser Gly Val Phe Leu Leu Pro Tyr Arg Leu Pro Pro Tyr Thr Gin Leu 
65 70 75 80 

Ala Pro Cys Gly Gin Asn Glu Thr Arg Glu Asp Gly Lys Thx lie Gin 
85 90 95 

Leu Pro Pro Cys Lys Thr Gly Ala Trp lie Val Pro Ala lie Met Ala 
100 105 110 

Cys Tyr Leu Leu Val Ala Asn lie Leu Leu Val Asn Leu Leu lie Ala 
115 120 125 



Val Phe Asn Asn Thr Phe Phe Glu Val Lys Ser He Ser Asn Gin Val 
130 135 140 

Trp Lys Phe Gin Arg Tyr Gin Leu He Met Thr Phe His Glu Arg Pro 
145 150 155 160 



Val Leu Pro Pro Pro Leu lie He Phe Ser His Met Thr Met He Phe 
165 170 175 



Gin His Val Cys Cys Arg Trp Arg Lys His Glu Ser Asp Gin Asp Glu 
180 185 190 

Arg Asp Tyr Gly Leu Lys Phe Leu He Thr Gly Asp Glu Leu Arg Lys 
195 200 205 

Val His Asp Phe Glu Glu Gin Cys He Glu Glu Tyr Phe Arg Glu Lys 
210 215 220 

Asp Asp Arg Phe Asn Ser Ser Asn Asp Glu Arg He Arg Val Thr Ser 
225 230 235 240 

Glu Arg Val Glu Asn Met Ser Met Arg Leu Glu Glu Val Asn Glu Arg 
245 250 255 

Glu His Ser Met Lys Ala Ser Leu Gin Thr Val Asp He Arg Leu Ala 
260 265 270 



Gin Leu Glu Asp Leu He Gly Arg Met Ala Thr Ala Leu Glu Arg Leu 
275 280 285 



Thr Gly Leu Glu Arg Ala Glu Ser Asn Lys He Arg Ser Arg Thr Ser 
290 295 300 



Ser Asp Cys Thr Asp Ala Ala Tyr He Val Arg Gin Ser Ser Phe Asn 
305 310 315 320 



Ser Gin Glu Gly Asn Thr Phe Lys Leu Gin Glu Ser He Asp Pro Ala 
325 330 335 



Gly Glu Glu Thr He Ser Pro Thr Ser Pro Thr Leu Met Pro Arg Met 
340 345 350 



Arg Ser His Ser Phe Tyr Ser Val Asn Val Lys Asp Lys Gly Gly He 
355 360 365 



Glu Lys Leu Glu Ser He Phe Lys Glu Arg Ser Leu Ser Leu His Arg 
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370 375 380 



Ala Thr Ser Ser His Ser Val Ala Lys Glu Pro Lys Ala Pro Ala Ala 
385 390 395 400 



Pro Ala Asn Thr Leu Ala He Val Pro Asp Ser Arg Arg Pro Ser Ser 
405 410 415 



Cys He Asp He Tyr Val Ser Ala Met Asp Glu Leu His Cys Asp He 
420 425 430 



Glu Pro Leu Asp Asn Ser Met Asn He Leu Gly Leu Gly Glu Pro Ser 
435 440 445 



Phe Ser Ala Leu Ala Pro Ser Thr Thr Pro Ser Ser Ser Ala Tyr Ala 
450 455 460 



Thr Leu Ala Pro Thr Asp Arg Pro Pro Ser Arg Ser He Asp Phe Glu 
465 470 475 480 



Asp Leu Thr Ser Met Asp Thr Arg Ser Phe Ser Ser Asp Tyr Thr His 
485 490 495 



Leu Pro Glu Cys Gin Asn Pro Trp Asp Thr Asp Pro Pro Thr Tyr His 
500 505 510 



Thr He Glu Arg Ser Lys Ser Ser Arg Tyr Leu Ala Thr Thr Pro Phe 
515 520 525 



Leu Leu Glu Glu Ala Pro He Val Lys Ser His Ser Phe Met Phe Ser 
530 535 540 



Pro Ser Arg Ser Tyr Tyr Ala Asn Phe Gly Val Pro Val Lys Thr Ala 
545 550 555 560 



Glu Tyr Thr Ser He Thr Asp Cys He Asp Thr Arg Cys Val Asn Ala 
565 570 575 



Pro Gin Ala He Ala Asp Arg Ala Thr Phe Pro Gly Gly Leu Gly Asp 
580 585 590 



Lys Val Glu Asp Leu Ser Cys Cys His Pro Glu Arg Glu Ala Glu Leu 
595 600 605 

Ser His Pro Ser Ser Asp Ser Glu Glu Asn Glu Ala Arg Gly Gin Arg 
610 615 620 

Ala Ala Asn Pro He Ser Ser Gin Glu Ala Glu Asn Ala Asp Arg Thr 
625 630 635 640 



Leu Ser Asn Asn He Thr Val Pro Lys He Glu Arg Ala Asn Ser Tyr 
645 650 655 

Ser Ala Glu Glu Pro Asn val Pro Tyr Ala His Thr Arg Lys Ser Phe 
660 665 670 

Ser He Ser Asp Lys Leu Asp Arg Gin Arg Asn Thr Ala Ser Leu Arg 
675 680 685 
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Asn Pro Phe 
690 



Gin Arg 



16U 100 PCT.ST25 
Lys Thr lie Leu Gin Tyr Thr Pro Asn Lys Leu 
695 700 



Tyr Pro Glu 
705 



Cys Leu 



Leu Ser Ser Ser Thr Gly Ala Val Glu Leu Tyr 
710 715 720 



Asp Pro Ala 



Glu Ala 
725 



He Leu Leu Ala Ala Phe Leu Asp Gly Gly Tyr 
730 735 



<210> 21 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 21 

gctttagcct ggaacagagt cgac 24 



<210> 22 

<211> 24 

<212> DNA 

<213> Homo sapiens 



<210> 23 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 23 

gaccaaggaa tggcagttgg age 23 



<210> 24 

<211> 24 

<212> DNA 

<213> Homo sapiens 



<210> 25 

<211> 3456 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (342).. (1535) 

<223> 

<400> 25 

ggacggtcca gaggtgtcga aatgtcctgg ggacctgagc agcagccacc agggaagagg 60 

cagggaggga gctgaggacc aggcttggtt gtgagaatcc ctgagcccag gcggtagatg 120 

ccaggaggtg tctggactgg ctgggccatg cctgggctga cctgtccagc cagggagagg 180 

gtgtgagggc agatetgggg gtgcccagat ggaaggaggc aggcatgggg gacaeccaag 240 

gecccctgge agcaccatga actaagcagg acacctggag gggaagaact: gtggggacet 300 

ggaggcctcc aacgactcct tcctgcttce tggacaggac t atg get gtg cag gga 356 



<400> 22 

gtctttcttc ctcgcetcaa ggga 



24 



<400> 24 

gtggtcccgt tgtcagccag aatg 



24 



Met Ala Val Gin Gly 
1 5 



tec cag aga aga ctt ctg ggc tec etc aac tee acc ecc aca gee ate 
Ser Gin Arg Arg Leu Leu Gly Ser Leu Asn Ser Thr Pro Thr Ala He 
10 15 20 



404 



ccc cag ctg ggg ctg get gcc aac cag aca gga gee egg tgc ctg gag 
Pro Gin Leu Gly Leu Ala Ala Asn Gin Thr Gly Ala Arg Cys Leu Glu 



452 
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25 30 35 

gtg tec ate tet gae ggg cte ttc etc age ctg ggg ctg gtg age ttg 500 

Val Ser lie Ser Asp GXy Leu Phe Leu Ser Leu Gly Leu Val Ser Leu 
40 45 50 

gtg gag aac gcg ctg gtg gtg gcc ace ate gcc aag aac egg aae ctg 548 

Val Glu Asn Ala Leu Val Val Ala Thr lie Ala Lys Asn Arg Asn Leu 
55 60 65 

cac tea ccc atg tac tgc ttc ate tgc tgc ctg gcc ttg teg gac ctg 596 

His Ser Pro Met Tyr Cys Phe lie Cys Cys Leu Ala Leu Ser Asp Leu 
70 75 80 85 

ctg gtg age ggg age aac gtg ctg gag acg gee gte ate cte ctg ctg 644 

Leu Val Ser Gly Ser Asn Val Leu Glu Thr Ala Val He Leu Leu Leu 

90 95 100 

gag gcc ggt gca ctg gtg gcc egg get gcg gtg ctg eag cag ctg gac 692 

Glu Ala Gly Ala Leu Val Ala Arg Ala Ala Val Leu Gin Gin Leu Asp 
105 110 115 

aat gte act gac gtg ate ace tgc age tec atg ctg tec age etc tgc 740 

Asn Val Thr Asp Val He Thr Cys Ser Ser Met Leu Ser Ser Leu Cys 
120 125 130 

ttc ctg ggc gcc ate gcc gtg gac cgc tac ate tec ate ttc tac gca 788 

Phe Leu Gly Ala He Ala Val Asp Arg Tyr He Ser He Phe Tyr Ala 
135 140 145 

ctg cgc tac cac age ate gtg ace ctg eeg egg gcg egg cga gee gtt 836 

Leu Arg Tyr His Ser He Val Thr Leu Pro Arg Ala Arg Arg Ala Val 
150 155 160 165 

gcg gcc ate tgg gtg gcc agt gte gte ttc age acg etc ttc ate gcc 884 

Ala Ala He Trp Val Ala Ser Val Val Phe Ser Thr Leu Phe He Ala 

170 175 180 

tac tac gac cac gtg gcc gte ctg ctg tgc etc gtg gte ttc ttc ctg 932 

Tyr Tyr Asp His Val Ala Val Leu Leu Cys Leu Val Val Phe Phe Leu 
185 190 195 

/ 

get atg ctg gtg cte atg gcc gtg ctg tac gte cac atg ctg gee egg 980 

Ala Met Leu Val Leu Met Tlla Val Leu Tyr Val His Met Leu Ala Arg 
200 205 210 

gcc tgc cag cac gcc cag ggc ate gcc egg etc cac aag agg cag cgc 1028 

Ala Cys Gin His Ala Gin Gly He Ala Arg Leu His Lys Arg Gin Arg 
215 220 225 

eeg gte cac cag ggc ttt ggc ctt aaa ggc get gte acc cte ace ate 1076 

Pro Val His Gin Gly Phe Gly Leu Lys Gly Ala Val Thr Leu Thr He 
230 235 240 245 

ctg ctg ggc att ttc ttc etc tgc tgg ggc ccc ttc ttc ctg cat etc 1124 

Leu Leu Gly He Phe Phe Leu Cys Trp Gly Pro Phe Phe Leu His Leu 

250 255 260 

aca etc ate gte etc tgc ccc gag cac ccc acg tgc ggc tgc ate ttc 1172 

Thr Leu He Val Leu Cys Pro Glu His Pro Thr Cys Gly Cys He Phe 

265 270 275 

aag aac ttc aac etc ttt etc gcc etc ate ate tgc aat gcc ate ate 1220 

Lys Asn Phe Asn Leu Phe Leu Ala Leu He He Cys Asn Ala He He 
280 285 290 

gac ccc etc ate tac gcc ttc cac age cag gag etc cgc agg acg etc 1268 

Asp Pro Leu He Tyr Ala Phe His Ser Gin Glu Leu Arg Arg Thr Leu 
295 300 305 

aag gag gtg ctg aca tgc tec tgc tet cag gac egt gee etc gte age 1316 

Lys Glu Val Leu Thr Cys Ser Cys Ser Gin Asp Arg Ala Leu Val Ser 

310 315 320 325 

tgg gat gtg aag tet ctg ggt gga agt gtg tgc caa gag eta etc cea 1364 

Trp Asp Val Lys Ser Leu Gly Gly Ser Val Cys Gin Glu Leu Leu Pro 

330 335 340 

eag eag ccc eag gag aag ggg ctt tgt gac cag aaa get tea tee aca 1412 
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Gin Gin Pro Gin Glu Lys Gly Leu Cys Asp Gin Lys Ala Ser Ser Thr 
345 350 355 

gcc ttg cag egg etc ctg caa aag gag cct aga gga agg acg age agg 1460 
Ala Leu Gin Arg Leu Leu Gin Lys Glu Pro Arg Gly Arg Thr Ser Arg 
360 365 370 

tgc age agg gcc cca gtc ccc tec act ctt gac get gtc eta get gca 1508 
Cys Ser Arg Ala Pro Val Pro Ser Thr Leu Asp Ala Val Leu Ala Ala 
375 380 385 

gaa gag gcg ggt tec cag cct tec ctg tgaccacatg tgaectcagc 1555 
Glu Glu Ala Gly Ser Gin Pro Ser Leu 
390 395 
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<210> 26 

<211> 398 

<212> PRT 

<213> Homo sapiens 

<40D> 26 

Met Ala Val Gin Gly Ser Gin Arg Arg Leu Leu Gly Ser Leu Asn Ser 
15 10 15 

Thr Pro Thr Ala lie Pro Gin Leu Gly Leu Ala Ala Asn Gin Thr Gly 
20 25 30 



Ala Arg Cys Leu Glu Val Ser lie Ser Asp Gly Leu Phe Leu Ser Leu 
35 40 45 



Gly Leu val Ser Leu Val Glu Asn Ala Leu Val Val Ala Thr lie Ala 
50 55 60 



Lys Asn Arg Asn Leu His Ser Pro Met Tyr Cys Phe lie Cys Cys Leu 
65 70 75 80 

Ala Leu Ser Asp Leu Leu Val Ser Gly Ser Asn Val Leu Glu Thr Ala 
85 90 95 

Val He Leu Leu Leu Glu Ala Gly Ala Leu Val Ala Arg Ala Ala Val 
100 105 110 



Leu Gin Gin Leu Asp Asn Val Thr Asp Val He Thr Cys Ser Ser Met 
115 120 125 



Leu Ser Ser Leu Cys Phe Leu Gly Ala He Ala Val Asp Arg Tyr He 
130 135 140 



Ser He Phe Tyr Ala Leu Arg Tyr His Ser He Val Thr Leu Pro Arg 
145 150 155 160 



Ala Arg Arg Ala Val Ala Ala He Trp Val Ala Ser Val Val Phe Ser 
165 170 175 



Thr Leu Phe He Ala Tyr Tyr Asp His Val Ala Val Leu Leu Cys Leu 
180 185 190 



Val. val Phe Phe Leu Ala Met Leu Val Leu Met Ala Val Leu Tyr Val 
195 200 205 



His Met Leu Ala Arg Ala Cys Gin His Ala Gin Gly He Ala Arg Leu 
210 215 220 



His Lys Arg Gin Arg Pro Val His Gin Gly Phe Gly Leu Lys Gly Ala 
225 230 235 240 



Val Thr Leu Thr He Leu Leu Gly He Phe Phe Leu Cys Trp Gly Pro 
245 250 255 



Phe Phe Leu His Leu Thr Leu He Val Leu Cys Pro Glu His Pro Thr 
260 265 270 

Cys Gly Cys He Phe Lys Asn Phe Asn Leu Phe Leu Ala Leu He He 
275 280 285 
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Cys Asn Ala lie lie Asp Pro Leu lie Tyr Ala Phe His Ser Gin Glu 
290 295 300 

Leu Arg Arg Thr Leu Lys Glu Val Leu Thr Cys Ser Cys Ser Gin Asp 
305 310 315 320 

Arg Ala Leu Val Ser Trp Asp Val Lys Ser Leu Gly Gly Ser Val Cys 
325 330 335 

Gin Glu Leu Leu Pro Gin Gin Pro Gin Glu Lys Gly Leu Cys Asp Gin 
340 345 350 

Lys Ala Ser Ser Thr Ala Leu Gin Arg Leu Leu Gin Lys Glu Pro Arg 
355 360 365 



Gly Arg Thr Ser Arg Cys Ser Arg Ala Pro Val Pro Ser Thr Leu Asp 
370 375 380 



Ala Val Leu Ala Ala Glu Glu Ala Gly Ser Gin Pro Ser Leu 
390 395 



385 




<210> 


27 


<211> 


398 


<212> 


PRT 


<213> 


Homo 


<400> 


27 


Met Ala Val 


1 





10 15 



Thr Pro Thr Ala lie Pro Gin Leu Gly Leu Ala Ala Asn Gin Thr Gly 
20 25 30 



Ala Arg Cys Leu Glu Val Ser lie Ser Asp Gly Leu Phe Leu Ser Leu 
35 40 45 



Gly Leu Val Ser Leu val Glu Asn Ala Leu val Val Ala Thr He Ala 
50 55 60 



Lys Asn Arg Asn Leu His Ser Pro Met Tyr Cys Phe He Cys Cys Leu 
65 70 75 80 



Ala Leu Ser Asp Leu Leu Val Ser Gly Ser Asn Val Leu Glu Thr Ala 
85 90 95 



Val He Leu Leu Leu Glu Ala Gly Ala Leu Val Ala Arg Ala Ala Val 
100 105 110 



Leu Gin Gin Leu Asp Asn Val Thr Asp Val He Thr Cys Ser Ser Met 
115 120 125 



Leu Ser Ser Leu Cys Phe Leu Gly Ala He Ala Val Asp Arg Tyr He 
130 135 140 



Ser He Phe Tyr Ala Leu Arg Tyr His Ser He Val Thr Leu Pro Arg 
145 150 155 160 



Ala Arg Gin Ala Val Ala Ala He Trp Val Ala Ser Val Val Phe Ser 
165 170 175 
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Thr Leu Phe lie Ala Tyr Tyr Asp His VaX Ala Val Leu Leu Cys Leu 
180 185 190 

Val Val Phe Phe Leu Ala Met Leu Val Leu Met Ala Val Leu Tyr Val 
195 200 205 



His Met Leu Ala Arg Ala Cys Gin His Ala Gin Gly He Ala Arg Leu 
210 215 220 



His Lys Arg Gin Arg Pro Val His Gin Gly Phe Gly Leu Lys Gly Ala 
225 230 235 240 



Val Thr Leu Thr He Leu Leu Gly He Phe Phe Leu Cys Trp Gly Pro 
245 250 255 



Phe Phe Leu His Leu Thr Leu He Val Leu Cys Pro Glu His Pro Thr 
260 265 270 



Cys Gly Cys He Phe Lys Asn Phe Asn I>eu Phe Leu Ala Leu He He 
275 280 285 



Cys Asn Ala He He Asp Pro Leu He Tyr Ala Phe His Ser Gin Glu 
290 295 300 



Leu Arg Arg Thr Leu Lys Glu Val Leu Thr Cys Ser Cys Ser Gin Asp 
305 310 315 320 



Arg Ala Leu Val Ser Trp Asp Val Lys Ser Leu Gly Gly Ser Val Cys 
325 330 335 



Gin Glu Leu Leu Pro Gin Gin Pro Gin Glu Lys Gly Leu Cys Asp Gin 
340 345 350 



Lys Ala Ser Ser Thr Ala Leu Gin Arg Leu Leu Gin Lys Glu Pro Arg 
355 360 365 



Gly Arg Thr Ser Arg Cys Ser Arg Ala Pro Val Pro Ser Thr Leu Asp 
370 375 380 



Ala Val Leu Ala Ala Glu Glu Ala Gly Ser Gin Pro Ser Leu 
385 390 395 



<210> 28 

<211> 398 

<212> PRT 

<213> Homo sapiens 

<400> 28 

Met Ala Val Gin Gly Ser Gin Arg Arg Leu Leu Gly Ser Leu Asn Ser 
15 10 15 

Thr Pro Thr Ala He Pro Gin Leu Gly Leu Ala Ala Asn Gin Thr Gly 
20 25 30 

Ala Arg Cys Leu Glu Val Ser He Ser Asp Gly Leu Phe Leu Ser Leu 
35 40 45 



Gly Leu Val Ser Leu Val Glu Asn Ala Leu Val Val Ala Thr He Ala 
50 55 60 
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Lys Asn Arg Asn lieu His Ser Pro Met Tyr Cys Phe lie Cys Cys Leu 
65 70 75 80 

Ala Leu Ser Asp Leu Leu Val Ser Gly Ser Asn Val Leu Glu Thr Ala 
85 90 95 

Val lie Leu Leu Leu Glu Ala Gly Ala Leu Val Ala Arg Ala Ala Val 
100 105 110 

Leu Gin Gin Leu Asp Asn Val lie Asp Val He Thr Cys Ser Ser Met 
115 120 125 

Leu Ser Ser I^u Cys Phe Leu Gly Ala lie Ala Val Asp Arg Tyr lie 
130 135 140 



Ser He Phe Tyr Ala Leu Arg Tyr His Ser He Val Thr Leu Pro Arg 
145 150 155 160 



Ala Arg Arg Ala Val Ala Ala He Trp val Ala Ser val val Phe Ser 
165 170 175 



Thr Leu Phe He Ala Tyr Tyr Asp His Val Ala Val Leu Leu Cys Leu 

180 185 190 

Val Val Phe Phe Leu Ala Met Leu Val Leu Met Ala Val Leu Tyr Val 

195 200 205 



His Met Leu Ala Arg Ala Cya Gin His Ala Gin Gly He Ala Arg Leu 

210 215 220 

His Lys Arg Gin Arg Pro Val His Gin Gly Phe Gly Leu Lys Gly Ala 

225 230 235 240 



Val Thr Leu Thr He Leu Leu Gly He Phe Phe Leu Cys Trp Gly Pro 
245 250 255 



Phe Phe Leu His Leu Thr Leu He Val Leu Cys Pro Glu His Pro Thr 
260 265 270 



Cys Gly Cys He Phe Lys Asn Phe Asn Leu Phe Leu Ala Leu He He 
275 280 285 



Cys Asn Ala He He Asp Pro Leu He Tyr Ala Phe His Ser Gin Glu 
290 295 300 



Leu Arg Arg Thr Leu Lys Glu Val Leu Thr Cys Ser Cys Ser Gin Asp 
305 310 315 320 



Arg Ala Leu Val Ser Trp Asp Val Lys Ser Leu Gly Gly Ser Val Cys 
325 330 335 



Gin Glu Leu Leu Pro Gin Gin Pro Gin Glu Lys Gly Leu Cys Asp Gin 
340 345 350 

Lys^Ala Ser Ser Thr Ala Leu Gin Arg Leu Leu Gin Lys Glu Pro Arg 
355 360 365 

Gly Arg Thr Ser Arg Cys Ser Arg Ala Pro Val Pro Ser Thr Leu Asp 

370 375 380 
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Ala Val Leu Ala Ala Glu Glu Ala Gly Ser Gin Pro Ser Leu 
385 390 395 

<210> 29 

<211> 398 

- <212> PRT 

<213> Homo sapiens 

<400> 29 

Met Ala Val Gin Gly Ser Gin Arg Arg Leu LeU Gly Ser Leu Asn Ser 
15 10 15 

Thr Pro Tbr Ala lie Pro Gin Leu Gly Leu Ala Ala Asn Gin Thr Gly 
20 25 30 

Ala Arg Cys Leu Glu Val Ser He Ser Asp Gly Leu Phe Leu Ser Leu 
35 40 45 

Gly Leu val Ser Leu Val Glu Asn Ala Leu Val Val Ala Thr He Ala 
50 55 60 

Lys Asn Arg Asn Leu His Ser Pro Met Tyr Cys Phe He Cys Cys Leu 
65 • 70 75 80 

Ala Leu Ser Asp Leu Leu Val Ser Gly Ser Asn Val Leu Glu Thr Ala 
85 90 95 

Val He Leu Leu Leu Glu Ala Gly Ala Leu Val Ala Arg Ala Ala Val 
100 105 110 

Leu Gin Gin Leu Asp Asn Val He Asp Val He Thr Cys Ser Ser Met 
115 120 125 

Leu Ser Ser Leu Cys Phe Leu Gly Ala He Ala Val Asp Arg Tyr He 
. 130 135 140 

Ser He Phe Tyr Ala Leu Arg Tyr His Ser He Val Thr Leu Pro Arg 
145 150 155 160 

• Ala Arg Gin Ala Val Ala Ala He Trp Val Ala Ser Val Val Phe Ser 
165 170 175 

Thr Leu Phe He Ala Tyr Tyr Asp His Val Ala Val Leu Leu Cys Leu 
180 185 190 

Val Val Phe Phe Leu Ala Met Leu Val Leu Met Ala Val Leu Tyr Val 
195 200 205 

His Met Leu Ala Arg Ala Cys Gin His Ala Gin Gly He Ala Arg Leu 
210 215 220 

. His Lys Arg Gin Arg Pro Val His Gin Gly Phe Gly Leu Lys Gly Ala 
225 230 235 240 

Val Thr Leu Thr He Leu Leu Gly He Phe Phe Leu Cys Trp Gly Pro 
245 250 255 

Phe Phe Leu His Leu Thr Leu He Val Leu Cys Pro Glu His Pro Thr 
260 265 270 
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Cys Gly Cys He Phe Lys Asn Phe Asn Leu Phe Leu Ala Leu He He 
275 280 265 

Cys Asn Ala He He Asp Pro Leu He Tyr Ala Phe His Ser Gin Glu 
290 295 300 

Leu Arg Arg Thr Leu Lys Glu Val Leu Thr Cys Ser Cys Ser Gin Asp 
305 310 315 320 

Arg Ala Leu Val Ser Trp Asp Val Lys Ser Leu Gly Gly Ser Val Cys 
325 330 335 

Gin Glu Leu Leu Pro Gin Gin Pro Gin Glu Lys Gly Leu Cys Asp Gin 
340 345 350 

Lys Ala Ser Ser Thr Ala Leu Gin Arg Leu Leu Gin Lys Glu Pro Arg 
355 360 365 

Gly Arg Thr Ser Arg Cys Ser Arg Ala Pro Val Pro Ser Thr Leu Asp 
370 375 380 

Ala Val Leu Ala Ala Glu Glu Ala Gly Ser Gin Pro Ser Leu 
385 390 395 



<210> 30 

<211> 317 

<212> PRT 

<213> Homo sapiens 

<400> 30 

Met Ala val Gin Gly Ser Gin Arg Arg Leu Leu Gly Ser Leu Asn Ser 
15 10 15 

Thr Pro Thr Ala He Pro Gin Leu Gly Leu Ala Ala Asn Gin Thr Gly 
20 25 30 



Ala Arg Cys Leu Glu Val Ser He Ser Asp Gly Leu Phe Leu Ser Leu 
35 40 45 



Gly Leu Val Ser Leu Val Glu Asn Ala Leu Val Val Ala Thr He Ala 
50 55 60 

Lys Asn Arg Asn Leu His Ser Pro Met Tyr Cys Phe He Cys Cys Leu 
65 70 75 80 

Ala Leu Ser Asp Leu Leu Val Ser Gly Ser Asn Val Leu Glu Thr Ala 
85 90 95 



Val He Leu Leu Leu Glu Ala Gly Ala Leu Val Ala Arg Ala Ala Val 
100 105 110 



Leu Gin Gin Leu Asp Asn Val He Asp Val He Thr Cys Ser Ser Met 
115 120 125 

Leu Ser Ser Leu Cys Phe Leu Gly Ala He Ala Val Asp Arg Tyr He 
130 135 140 



Ser He Phe Tyr Ala Leu Arg Tyr His Ser He Val Thr Leu Pro Arg 
145 150 155 160 
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Ala Arg Gin Ala Val Ala Ala lie Trp Val Ala Ser Val Val Phe Ser 
165 170 175 



Thr Leu Phe lie Ala Tyr Tyr Asp His Val Ala Val Leu Leu Cys Leu 
180 185 190 



Val Val Phe Phe Leu Ala Met Leu Val Leu Met Ala Val Leu Tyr Val 
195 200 205 



His Met Leu Ala Arg Ala Cys Gin His Ala Gin Gly lie Ala Arg Leu 
210 215 220 



His Lys Arg Gin Arg Pro Val His Gin Gly Phe Gly Leu Lys Gly Ala 
225 230 235 240 



Val Thr Leu Thr lie Leu Leu Gly lie Phe Phe Leu Cys Trp Gly Pro 
245 250 255 



Phe Phe Leu His Leu Thr Leu lie Val Leu Cys Pro Glu His Pro Thr 
260 265 270 



Cys Gly Cys lie Phe Lys Asn Phe Asn Leu Phe Leu Ala Leu He He 
275 280 285 



Cys Asn Ala He He Asp Pro Leu He Tyr Ala Phe His Ser Gin Glu 
290 295 300 



Leu Arg Arg Thr Leu Lys Glu Val Leu Thr Cys Ser Trp 
305 310 315 



<210> 31 

<211> 382 

<212> PRT 

<213> Homo sapiens 

<400> 31 

Met Ala Val Gin Gly Ser Gin Arg Arg Leu Leu Gly Ser Leu Asn Ser 
15 10 15 



Thr Pro Thr Ala He Pro Gin Leu Gly Leu Ala Ala Asn Gin Thr Gly 
20 25 30 



Ala Arg Cys Leu Glu Val Ser He Ser Asp Gly Leu Phe Leu Ser Leu 
35 40 45 

Gly Leu Val Ser Leu Val Glu Asn Ala Leu Val Val Ala Thr He Ala 
50 55 60 



Lys Asn Arg Asn Leu His Ser Pro Met Tyr Cys Phe He Cys Cys Leu 
65 70 75 80 

Ala Leu Ser Asp Leu Leu Val Ser Gly Ser Asn Val Leu Glu Thr Ala 
85 90 95 

Val He Leu Leu Leu Glu Ala Gly Ala Leu Val Ala Arg Ala Ala Val 
100 105 110 

Leu Gin Gin Leu Asp Asn Val He Asp Val He Thr Cys Ser Ser Met 
115 120 125 
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Leu Ser Ser Leu Cys Phe Leu Gly Ala xXe Ala Val Asp Arg Tyr He 
130 135 140 

Ser He Phe Tyr Ala Leu Arg Tyr His Ser He Val Thr Leu Pro Arg 
145 150 155 160 

Ala Arg Arg Ala Val Ala Ala He Trp val Ala Ser Val Val Phe Ser 
165 170 175 

Thr Leu Phe He Ala Tyr Tyr Asp His Val Ala Val Leu Leu Cys Leu 
180 185 190 

Val Val Phe Phe Leu Ala Met Leu Val Leu Met Ala Val Leu Tyr Val 
195 200 205 

His Met Leu Ala Arg Ala Cys Gin His Ala Gin Gly He Ala Arg Leu 
210 215 220 

His Lys Arg Gin Arg Pro Val His Gin Gly Phe Gly Leu Lys Gly Ala 
225 230 235 240 

Val Thr Leu Thr He Leu Leu Gly He Phe Phe Leu Cys Trp Gly Pro 
245 250 255 

Phe Phe Leu His Leu Thr Leu He Val Leu Cys Pro Glu His Pro Thr 
260 265 270 

Cys Gly Cys He Phe Lys Asn Phe Asn Leu Phe Leu Ala Leu He He 
275 280 285 

Cys Asn Pila He He Asp Pro Leu He Tyr Ala Phe His Ser Gin Glu 
290 295 300 

Leu Arg Arg Thr Leu Lys Glu Val Leu Thr Cys Ser Cys Ser Gin Asp 
305 310 315 320 

Arg Ala Leu Val Ser Trp Asp Val Lys Ser Leu Gly Gly Ser Val Cys 
325 330 335 

Gin Glu Leu Leu Pro Gin Gin Pro Gin Glu Lys Gly Leu Cys Asp Gin 
340 345 350 

Lys Ala Ser Ser Thr Ala Leu Gin Arg Leu Leu Gin Lys Glu Val Lys 
355 360 365 

Ser Leu Pro Gin Ala Lys Gly Pro Gly Leu Gin Glu Pro Pro 
370 375 380 

<210> 32 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 32 

cctcatcatc tgcaatgcca tc 22 

<210> 33 

<211> 22 

<212> DNA 

<213> Homo sapiens 
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<400> 33 

gctcgtcctt cctctaggct cc 



<210> 34 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 34 

aggaagcagc tcccagttct ga 



<210> 35 

<211> 50 

<212> DNA 

<213> Homo sapiens 

<400> 35 

cttccgcagc ggaaatggcg cgccgcccgg ggagggcggg agcagcgtcc 



<210> 36 

<211> 50 

<212> DNA 

<213> Homo sapiens 

<400> 36 

cctcaggctc tacaagatgc ctgaaaacac caacctctcc agggctcact 



<210> 37 

<211> 50 

<212> DNA 

<213> Homo sapiens 

<400> 37 

aacgactttt taaaacgcag agaaaagctc cattcttccc aggacctcag 



<210> 38 

<211> 7062 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (166) (5288) 
<223> 

<400> 38 

cggtcgcagc gcacagagct tgctggccag ggaggagcta gtctccgtgg gcgccgccgc 

cgccccagcc tgcgcgcctc tctcctggcg cgcgccagtc tggcactctg ggagctgggt 

cctagcacca cagacttatc cttcgcctgc actttccgtc tttcttctct gggcgcccac 

caaca atg gat ggc aac tec ctg etc teg gta cca age aac ttg gag tea 
Met Asp Gly Asn Ser Leu Leu Ser Val Pro Ser Aan Leu Glu Ser 
15 10 15 

tea egg atg tat gac gtt ttg gaa ecg cag cag ggc aga ggc tgt ggc 
Ser Arg Met Tyr Asp Val Leu Glu Pro Gin Gin Gly Arg Gly Cya Gly 
20 25 30 

age tea gga age ggc ceg ggg aac tee ate aca gcc tgt aag aag gtt 
Ser Ser Gly Ser Gly Pro Gly Asn Ser lie Thr Ala Cys Lys Lys Val 
35 40 45 

ctt cge age aat ago ctg ctg gag tea aca gac tac tgg ttg cag aat 
Leu Arg Ser Asn Ser Leu Leu Glu Ser Thr Asp Tyr Trp Leu Gin Asn 
50 55 60 

cag agg atg ecc tge caa att ggt ttt gta gaa gac aag tet gaa aac 
Gin Arg Met Pro Cys Gin lie Gly Phe Val Glu Asp Lys Ser Glu Asn 
65 70 75 

tgt get tet gtc tgc ttt gtg aat ctt gat gtg aac aag gat gaa tge 
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Cys Ala Ser Val Cys Phe Val Asn Leu Asp Val Asn Lys Asp Glu Cys 
80 85 90 95 

age aca gag cac ctg caa cag aaa ctg gtc aac gtt tea cca gat ctt 518 
Ser Tlir Glu His Leu Gin Gin Lys Leu Val Asn Val Ser Pro Asp Leu 
100 105 110 

cca aaa ctt ate agt tec atg aat gtc caa caa cca aaa gaa aat gaa 566 
Pro Lys Leu lie Ser Ser Met Asn Val Gin Gin Pro Lys Glu Asn Glu 
115 120 125 

att gtt gtc eta agt ggg tta gcc tct gga aat etc cag gca gat ttt 614 
lie Val Val Leu Ser Gly Leu Ala Ser Gly Asn Leu Gin Ala Asp Phe 
130 135 140 

gaa gtt tea cag tgc cct tgg ctg cca gat ate tgc ttg gtc caa tgt 662 
Glu Val Ser Gin Cys Pro Trp Leu Pro Asp He Cys Leu val Gin Cys 
145 150 155 

gca aga ggg aac aga cca aac agt acc aac tgc ate ate ttt gaa ate 710 
Ala Arg Gly Asn Arg Pro Asn Ser Thr Asn Cys He He Phe Glu He 
160 165 170 175 

aac aaa ttt ctg att ggt ctg gaa ctg gtg cag gag ega cag etc cac 758 
Asn Lys Phe Leu He Gly Leu Glu Leu Val Gin Glu Arg Gin Leu His 
180 185 190 

ctg gaa aca aac ate ttg aaa ctg gag gat gac acg aac tgt tee tta 806 
Leu Glu Thr Asn He Leu Lys Leu Glu Asp Asp Thr Asn Cys Ser Leu 
195 200 205 

tct tea ate gag gaa gac ttt etc acc get tct gag cac ttg gag gag 854 
Ser Ser He Glu Glu Asp Phe Leu Thr Ala Ser Glu His Leu Glu Glu 
210 215 220 

gaa age gag gtg gat gaa tct agg aac gat tat gaa aat ata aat gtc 902 
Glu Ser Glu Val Asp Glu Ser Arg Asn Asp Tyr Glu Asn He Asn Val 

225 230 235 

tea gcc aat gtt ttg gaa agt aaa cag eta aag gga gee acc cag gtg 950 
Ser Ala Asn Val Leu Glu Ser Lys Gin Leu Lys Gly Ala Thr Gin Val 
240 245 250 255 

gaa tgg aat tgc aac aag gaa aag tgg ctt tat get ttg gaa gac aaa 998 
Glu Trp Asn Cys Asn Lys Glu Lys Trp Leu Tyr Ala Leu Glu Asp Lys 
260 265 270 ' 

tac ate aac aaa tat ece aca eca ttg att aaa aca gaa cga tct cca 1046 
Tyr He Asn Lys Tyr Pro Thr Pro Leu He Lys Thr Glu Arg Ser Pro 
275 280 285 

gaa aac eta aca aag aac aca gee ttg cag agt eta gat cec tea gee 1094 
Glu Asn Leu Thr Lys Asn Thr Ala Leu Gin Ser Leu Asp Pro Ser Ala 
290 295 300 

aag cca tea cag tgg aaa aga gaa get gtg ggg aat ggg aga caa gee 1142 
Lys Pro Ser Gin Trp Lys Arg Glu Ala Val Gly Asn Gly Arg Gin Ala 
305 310 315 

aca cat tat tat cat tea gaa get ttt aaa ggt caa atg gaa aaa tea 1190 
Thr His Tyr Tyr His Ser Glu Ala Phe Lys Gly Gin Met Glu Lys Ser 
320 325 330 335 

cag gca ctg tat att cca aaa gat get tat tte tee atg atg gat aaa 1238 
Gin Ala Leu Tyr He Pro Lys Asp Ala Tyr Phe Ser Met Met Asp Lys 
340 345 350 

gat gta cct tct gca tgt get gtg gca gag cag aga age aac eta aac 1286 
Asp Val Pro Ser Ala Cys Ala Val Ala Glu Gin Arg Ser Asn Leu Asn 
355 360 365 > 

eca gga gac cat gaa gac aca aga aac get etc cct cct aga caa gat 1334 
Pro Gly Asp His Glu Asp Thr Arg Asn Ala Leu Pro Pro Arg Gin Asp 
370 375 380 

gga gaa gtc ace act ggc aag tat get aca aat tta gca gaa tec gtg 1382 
Gly Glu Val Thr Thr Gly Lys Tyr Ala Thr Asn Leu 7U.a Glu Ser Val 
385 390 395 
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ctg cag gat gca ttt att aga tta tct caa tct cag tec aca tta ccc 1430 
Leu Gin Asp Ala Phe lie Arg Leu Ser Gin Ser Gin Ser Thr Leu Pro 
400 405 410 415 

cag gaa tct gca gtc agt gtt tct gta gga agt tct ctg ctt ccc agt 1478 
Gin Glu Ser Ala Val Ser Val Ser Val Gly Ser Ser Leu Leu Pro Ser 
420 425 430 

tgc tat tec aca aaa gat aca gtg gtt tct egg tea tgg aat gag etc 1526 
Cys Tyr Ser Thr Lys Asp Thr Val Val Ser Arg Ser Trp Asn Glu Leu 
435 440 445 

ccc aaa ate gtc gtt gtt cag agt cca gat ggc agt gat get gee eca 1574 
Pro Lys lie Val Val Val Gin Ser Pro Asp Gly Ser Asp Ala Ala Pro 
450 455 460 

cag cca ggc ate tec tec tgg cet gag atg gaa gtc tct gtt gaa ace 1622 
Gin Pro Gly lie Ser Ser Trp Pro Glu Met Glu Val Ser Val Glu Thr 
465 470 475 

tea age ate etc tct gga gag aae tec age aga caa ccc cag agt get 1670 
Ser Ser lie Leu Ser Gly Glu Asn Ser Ser Arg Gin Pro Gin Ser Ala 
480 485 490 495 

eta gaa gtg geg tta get tgt gea gee act gtg att gga act att tec 1718 
Leu Glu val Ala Leu Ala Cys Ala Ala Thr Val lie Gly Thr lie Ser 
500 505 510 

agt eca cag gee aca gaa aga etc aaa atg gag caa gtg gtc teg aac 1766 
Ser Pro Gin Ala Thr Glu Arg Leu Lys Met Glu Gin Val Val Ser Asn 
515 520 525 

ttt ccc cca ggg age agt ggt gea ctg caa act caa gea ccc caa gga 1814 
Phe Pro Pro Gly Ser Ser Gly Ala Leu Gin Thr Gin Ala Pro Gin Gly 
530 535 540 

etc aag gaa cct tec ate aat gag tac tec ttt cca tct get ttg tgt 18 62 

Leu Lys Glu Pro Ser lie Asn Glu Tyr Ser Phe Pro Ser Ala Leu Cys 
545 550 555 

ggc atg act cag gtg gee agt gee gtg get gtc tgt ggt ctg ggt gaa 1910 
Gly Met Thr Gin Val Ala Ser Ala Val Ala Val Cys Gly Leu Gly Glu 
560 565 570 575 

aga gaa gag gtg aca tgc tea gtg get cca agt ggt age etc ceg cct 1958 
Arg Glu Glu Val Thr Cys Ser Val Ala Pro Ser Gly Ser Leu Pro Pro 
580 585 590 

gca get gag get tct gaa gee atg ccc cca ctt tgt ggt tta gca age 2006 
Ala Ala Glu Ala Ser Glu Ala Met Pro Pro Leu Cys Gly Leu Ala Ser 
595 600 605 

atg gag ctt ggc aag gaa gee att gee gag gga ttg etc aag gag get 2054 
Met Glu Leu Gly Lys Glu Ala lie Ala Glu Gly Leu Leu Lys Glu Ala 
610 615 620 

get ctg gtt tta aca agg cct aat ace tac age age att gga gae ttt 2102 
Ala Leu Val Leu Thr Arg Pro Asn Thr Tyr Ser Ser He Gly Asp Phe 
625 630 635 

ctg gae tee atg aae agg aga ate atg gaa act get tea aag tct cag 2150 
Leu Asp Ser Met Asn Arg Arg He Met Glu Thr Ala Ser Lys Ser Gin 
640 645 650 655 

ace ctg tgc tea gaa aat gtc gtc agg aat gaa ctg gca cat ace ctg 2198 
Thr Leu Cys Ser Glu Asn Val Val Arg Asn Glu Leu Ala His Thr Leu 
660 665 670 

tec aat gtt ate ctg agg cat tec att gat gaa gtt cae cac aaa aat 2246 
Ser Asn Val He Leu Arg His Ser He Asp Glu Val His His Lys Asn 
675 680 685 

atg ata ate gae ccc aat gae aac agg cat tea tct gaa att ctg gae 2294 
Met He He Asp Pro Asn Asp Asn Arg His Ser Ser Glu He Leu Asp 
690 695 700 

ace tta atg gaa agt aca aat caa ctg ctt tta gat gtg ata tgc tte 2342 
Thr Leu Met Glu Ser Thr Asn Gin Leu Leu Leu Asp Val He Cys Phe 
705 710 715 
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acg ttc aag aag atg agl: cat att gta egg ctt ggt gaa tgt cct get 2390 
Thr Pbe Lys I*ys Met Ser His He Val Arg Leu Gly Glu Cys Pro Ala 
720 725 730 735 

gtc ctt tct aag gag acc ate aga agg agg gag aca gaa cca age tgc 2438 
Val Leu Ser Lys Glu Thr He Arg Arg Arg Glu Thr Glu Pro Ser Cys 
740 745 750 

cag cca tct gat ccg ggt get agt eaa get tgg aca aaa gee act gaa 2486 
Gin Pro Ser Asp Pro Gly Ala Ser Gin Ala Trp Thr Lys Ala Thr Glu 
755 760 765 

tec tec age age tct cca ctt age aat tea cac aac acg agt ctt gtc 2534 
Ser Ser Ser Ser Ser Pro Leu Ser Asn Ser His Asn Thr Ser Leu Val 
770 775 780 

ate aac aat ctt gtg gat ggc atg tat tea aaa eaa gae aag ggt gga 2582 
He Asn Asn Leu Val Asp Gly Met Tyr Ser Lys Gin Asp Lys Gly Gly 
785 790 795 

gtg agg cca ggc etc ttc aag aac ecc acg ctg eag tea eaa tta tea 2630 
Val Arg Pro Gly Leu Phe Lys Asn Pro Thr Leu Gin Ser Gin Leu Ser 
800 805 810 815 

egt agt cae aga gtg ecc gat tct tea act get aca aca tee tee aag 2678 
Arg Ser His Arg Val Pro Asp Ser Ser Thr Ala Thr Thr Ser Ser Lys 
820 825 830 

gaa ata tat ctg aaa gga ata gea gga gag gat aca aaa age cct cat 2726 
Glu He Tyr Leu Lys Gly He Ala Gly Glu Asp Thr Lys Ser Pro His 
835 840 845 

cac agt gag aat gaa tgc aga gee tct tec gaa gga eaa agg tec cca 2774 
His Ser Glu Asn Glu Cys Arg Ala Ser Ser Glu Gly Gin Arg Ser Pro 
850 855 860 

acg gtc age egg tec aga agt ggt tec cag gag get gag gag agt ate 2822 
Thr Val Ser Arg Ser Arg Ser Gly Ser Gin Glu Ala Glu Glu Ser He 
865 870 875 

cac cca aac acc eaa gaa aag tac aac tgt gee aca tct cgc ate aac 2870 
His Pro Asn Thr Gin Glu Lys Tyr Asn Cys Ala Thr Ser Arg He Asn 
880 885 890 895 

gaa gtt caa gtc aac ctg tec ttg tta ggg gat gac ctg ctg ctt cct 2918 
Glii Val Gin Val Asn Leu Ser Leu Leu Gly Asp Asp Leu Leu Leu Pro 
900 905 910 

get caa tec acg ctt caa aca aag cat cca gac ate tac tgc att aca 2966 
Ala Gin Ser Thr Leu Gin Thr Lys His Pro Asp He Tyr Cys He Thr 

915 920 925 

gac ttt gcg gaa gaa tta gea gac acg gtc gtc tee atg gea act gaa 3014 
Asp Phe Ala Glu Glu Leu Ala Asp Thr val val Ser Met Ala Thr Glu 
930 935 940 

att gea gcg att tgc ctt gac aac tec agt gga aaa eaa ecc tgg ttt 3062 
He Ala Ala He Cys Leu Asp Asn Ser Ser Gly Lys Gin Pro Trp Phe 
945 950 955 

tgt gea tgg aaa aga ggg agt gag ttt ctg atg aca ecc aac gta ecc 3110 
Cys Ala Trp Lys Arg Gly Ser Glu Phe Leu Met Thr Pro Asn val Pro 
960 965 970 975 

tgc ega tec ttg aag agg aag aaa gag age cag ggg age ggg ace get • 3158 
Cys Arg Ser Leu Lys Arg Lys Lys Glu Ser Gin Gly Ser Gly Thr Ala 
980 985 990 

gtg agg aaa cac aag cct ecc egg etc agt gag ate aag agg aag acg 3206 
Val Arg Lys His Lys Pro Pro Arg Leu Ser Glu He Lys Arg Lys Thr 
995 1000 1005 

gae gag cac cct gag ctt aaa gaa aag ctg atg aac agg gtt gtg .3251 
Asp Glu His Pro Glu Leu Lys Glu Lys Leu Met Asn Arg Val Val 
1010 1015 1020 

gat gag tec atg aac ctt gaa gat gtc cca gat tct gtc aat ctt 3296 
Asp Glu Ser Met Asn Leu Glu Asp Val Pro Asp Ser Val Asn Leu 
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ttt gcc aat gaa gtg gca gcc aag ate atg aac eta aeg gag ttc 
Phe Ala Asn Glu Val Ala Ala Lys lie Met Asn Leu Thr Glu Phe 
1040 1045 1050 

tct atg gtg gae ggc atg tgg cag gcg cag ggc tat ccc egg aat 
Ser Met Val Asp Gly Met Trp Gin Ala Gin Gly Tyr Pro Arg Asn 
1055 1060 1065 

egg tta ctg agt ggc gac agg tgg age egg ctg aag gcc tee age 
Arg Leu Leu Ser Gly Asp Arg Trp Ser Arg Leu Lys Ala Ser Ser 
1070 1075 1080 

tgc gaa age att ect gag gaa gac tec gag gcc agg gcc tat gtc 
Cys Glu Ser lie Pro Glu Glu Asp Ser Glu Ala Arg Ala Tyr Val 
1085 1090 1095 



3341 



3386 



3431 



3476 



aac age ctg ggc tta atg age aeg ctg age cag eeg gtc age agg 
Asn Ser Leu Gly Leu Met Ser Thr Leu Ser Gin Pro Val Ser Arg 
1100 1105 1110 

gee age tct gtc tec aag cag teg age tgt gag age ate ace gat 
Ala Ser Ser Val Ser Lys Gin Ser Ser Cys Glu Ser lie Thr Asp 
1115 1120 1125 



3521 



3566 



gag ttt tee agg ttc atg gtg aac 
Glu Phe Ser Arg Phe Met Val Asn 
1130 1135 



cag atg gaa aat gaa ggg aga 
Gin Met Glu Asn Glu Gly Arg 
1140 



gga ttt gag tta ctg ctg gat tae tat get ggc aag aac gcc age 
Gly Phe Glu Leu Leu Leu Asp Tyr Tyr Ala Gly Lys Asn Ala Ser 
1145 1150 1155 



age att ctg aac tea gcc atg caa 
Ser lie Leu Asn Ser Ala Met Gin 
1160 1165 



cag gcg tgc egg aaa agt gac 
Gin Ala Cys Arg Lys Ser Asp 
1170 



cac etc agt gtg agg ect age tgt ccc tct aag eag tec age aca 
His Leu Ser Val Arg Pro Ser Cys Pro Ser Lys Gin Ser Ser Thr 
1175 1180 1185 



3611 



3656 



3701 



3746 



gag age ate 
Glu Ser lie 

1190 



act gag gag ttc tae 
Thr Glu Glu Phe Tyr 
1195 



agg tae atg ctg agg gac ate 
Arg Tyr Met Leu Arg Asp lie 
1200 



3791 



gaa aga gac age aga gaa agt gcc tec tee aga egg age age cag 
Glu Arg Asp Ser Arg Glu Ser Ala Ser Ser Arg Arg Ser Ser Gin 
1205 1210 1215 



3836 



gat tgg aca gcc ggc ctg ctg tct ect tct ctg ega tee cca gtg 

Asp Trp Thr Ala Gly Leu Leu Ser Pro Ser Leu Arg Ser Pro Val 
1220 1225 1230 

tgc cac aga cag teg tec atg cca gac age aga tec cca tgc tec 

Cys His Arg Gin Ser Ser Met Pro Asp Ser Arg Ser Pro Cys Ser 
1235 1240 1245 



3881 



3926 



agg ctg aca gtg aat gtg ccc ate aaa gcc aac tct tta gat ggc 
Arg Leu Thr Val Asn Val Pro He Lys Ala Asn Ser Leu Asp Gly 
1250 1255 1260 



ttt get cag aac tgc cca caa gat 
Phe Ala Gin Asn Cys Pro Gin Asp 
1265 1270 



ttc eta age gtg cag ccg gtc 
Phe Leu Ser Val Gin Pro Val 
1275 



3971 



4016 



agt age gcg tee tea tec ggt etc tgc aaa tct gae tct tgc ttg 
Ser Ser Ala Ser Ser Ser Gly Leu Cys Lys Ser Asp Ser Cys Leu 
1280 1285 1290 



4061 



tat egg aga ggt ggg act gae cac ate acc aac atg tta att cat 
Tyr Arg Arg Gly Gly Thr Asp His He Thr Asn Met Leu He His 
1295 1300 1305 



4106 



gaa aeg tgg get age tec att gag get etc atg egc aag aac aaa 
Glu Thr Trp Ala Ser Ser He Glu Ala Leu Met Arg Lys Asn Lys 
1310 1315 1320 



4151 



ate att gtg gat gat gca gag gaa get gac act gag ect gtt tct 
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lie lie Val Asp Asp Ala Glu Glu Ala Asp Thr Glu Pro Val Ser 
1325 1330 1335 



gcft ggc tct ccc teg caa gca gag 
Gly Gly Ser Pro Ser Gin Ala Glu 
1340 1345 



aag tgt gca aat aga tta get 
Lys Cys Ala Asn Arg Leu Ala 
1350 



gcg age agg atg tgc agt ggg cca act ctg ctt gtt cag gag tct 
Ala Ser Arg Met Cys Ser Gly Pro Thr Leu Leu Val Gin Glu Ser 
1355 1360 1365 



etc gat tgc 
Leu Asp Cys 
1370 



ccg agg aaa gac tct 
Pro Arg Lys Asp Ser 
1375 



gtt acc gaa tgt aaa cag ccc 
Val Thr Glu Cys Lys Gin Pro 
1380 



cca gtg tea tct ttg age aaa act get tct ctt aca aac cac age 
Pro Val Ser Ser Leu Ser Lys Thr Ala Ser Leu Thr Asn His Ser 
1385 1390 1395 



cct tta gat tct aaa aaa gaa act 
Pro Leu Asp Ser Lys Lys Glu Thr 
1400 1405 



tec teg tgc cag gac cct gta 
Ser Ser Cys Gin Asp Pro Val 
1410 



cca ata aac cac aaa agg cga tea ctt tgc teg agg gaa gtg cct 

Pro He Asn His Lys Arg Arg Ser Leu Cys Ser Arg Glu Val Pro 
1415 1420 1425 

ttg att cag att gaa aca gat cag aga gaa gee tgt get ggg gaa 

Leu He Gin He Glu Thr Asp Gin Arg Glu Ala Cys Ala Gly Glu 
1430 1435 1440 



4241 



4286 



4331 



4376 



4421 



4466 



4511 



cct gaa ccc ttc ctt tec aaa age age etc eta gag gaa gca gaa 
Pro Glu Pro Phe Leu Ser Lys Ser Ser Leu Leu Glu Glu Ala Glu 
1445 1450 1455 



4556 



ggg cat teg aat gac aaa aac ate cca gat gtg gtg aga ggt gga 
Gly His Ser Asn Asp Lys Asn He Pro Asp Val Val Arg Gly Gly 
1460 1465 1470 



4601 



gac aca gee gtg age get tgt caa ate cat agt gac age 
Asp Thr Ala Val Ser Ala Cys Gin He His Ser Asp Ser 
1475 1480 1485 



acc aga gat gta cca gag get gaa 
Thr Arg Asp Val Pro Glu Ala Glu 
1490 1495 



gee tec aca gaa gee 
Ala Ser Thr Glu Ala 
1500 



ctt gat 
Leu Asp 



aga gee 
Arg Ala 



4646 



4691 



ccc gat gag gcc ccc aac cct cca 
Pro Asp Glu Ala Pro Asn Pro Pro 
1505 1510 



age age age gag gag age aca 
Ser Ser Ser Glu Glu Ser Thr 
1515 



4736 



ggc age tgg 
Gly Ser Trp 
1520 



acc cag ctt gcc aat 
Thr Gin Leu Ala Asn 
1525 



gag gaa gac aac cca gat gac 
Glu Glu Asp Asn Pro Asp Asp 
1530 



4781 



aca agt age ttt etc cag etc agt gag cga tec atg age aat ggc 
Thr Ser Ser Phe Leu Gin Leu Ser Glu Arg Ser Met Ser Asn Gly 
1535 1540 1545 



4826 



aac agt agt gee act age agt ctt 
Asn Ser Ser Ala Thr Ser Ser Leu 
1550 1555 



ggc att atg gat ctg gac att 
Gly He Met Asp Leu Asp He 
1560 



4871 



tat cag gaa age atg cca tct tct 
Tyr Gin Glu Ser Met Pro Ser Ser 
1565 1570 



ccc atg att aat gaa tta gta 
Pro Met He Asn Glu Leu Val 

1575 



4916 



gaa gaa aag aag att ctt aaa gga 
Glu Glu Lys Lys He Leu Lys Gly 
1580 1585 



cag tea gaa age aca gag gca 
Gin Ser Glu Ser Thr Glu Ala 
1590 



4961 



cct gca tct gga ccg cct acg gga aca gee age ccc cag agg age 
Pro Ala Ser Gly Pro Pro Thr Gly Thr Ala Ser Pro Gin Arg Ser 
1595 1600 1605 



5006 



ctg ctg gtg ate aac ttt gac ctg 
Leu Leu Val He Asn Phe Asp Leu 
1610 1615 



gag cca gag tgt cca gat gcc 
Glu Pro Glu Cys Pro Asp Ala 
1620 



5051 
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gag etc cga gcc act ctg cag tgg ata got gcc tct gaa ctg ggg 5096 
Glu Leu Arg Ala Thr Leu Gin Trp He Ala Ala Ser Glu Leu Gly 
1625 1630 1635 

att ccc acc ate tac ttt aag aaa tct cag gaa aac aga att gaa 5141 
He Pro Thr He Tyr Phe Lys Lys Ser Gin Glu Asn Arg He Glu 
1640 1645 1650 

aag ttt eta gat gtc gtg cag etg gtt cat egg aag tec tgg aaa 5186 
Lys Phe Leu Asp Val Val Gin Leu Val His Arg Lys Ser Trp Lys 
1655 1660 1665 

gtg ggt gat ate ttc cat gca gtt gtc cag tac tgc aaa atg cat 5231 
Val Gly Asp He Phe His Ala Val Val Gin Tyr Cys Lys Met His 
1670 1675 1680 

gag gag cag aag gat ggg aga ctg agt etc ttt gac tgg etc ttg 5276 
Glu Glu Gin Lys Asp Gly Arg Leu Ser Leu Phe Asp Trp Leu Leu 
1685 1690 1695 

gaa ctg gga taa taaggcagtc tgccgtatag atcattcctt ccctttattc 5328 
Glu Leu Gly 
1700 



caacttagat 


tacagtggtt 


tgttctaaat 


gctctaaaea 


ttetcaaaac 


ateacatcac 


5388 


attageagaa 


etataaaaaa 


aaatctgeta 


etcagatcca 


ctgeatacag 


aataagteag 


5448 


aggaaaagea 


aaatataggt 


ctgtccaaat 


tcatacaact 


tgtgggtgag 


ttccaaagag 


5508 


ettggattag 


aagggctgga 


caaagagaga 


attcaatggg 


gcccaaatta 


gaatgcttat 


5568 


aatgagaccc 


aatctccagg 


aaaacaacac 


tcacataagt 


ttaatcatat 


aaaatgattt 


5628 


gtaatgtctc 


taattagatg 


aatcaactag 


aaacaaactc 


agtggtcaaa 


ataattttta 


5688 


agagtattcc 


gtaacctata 


ttttactttt 


ctgattatat 


taaggggetg 


ccagcccgga 


5748 


gaaatactta 


agatatgggt 


gagaaateec 


cagactttta 


tacaaaagat 


ttccactttc 


5808 


aaatcaatgt 


cagtagaeat 


tgataaaagt 


atagcagcat 


cctetaetga 


ggtgatttca 


5868 


tttattccet 


geageecaet 


gataaatate 


teacttctec 


caaatagtat 


gtggactccc 


5928 


agctaagcag 


aaaactattg 


tcattcaact 


gaagaggaag 


ataaaagatt 


gtcttgtttc 


5988 


catcactgta 


ttacttgtgt 


aacatgatta 


cataattctt 


atcctaagag 


aaagctttca 


6048 


tatttaaaaa 


aaagtctttt 


cagataaaat 


ctgcttgtgt 


cttgaataat 


atgaaataca 


6108 


aactttcact 


ttattttatt 


gtaaattata 


aagagattat 


tgtcttaaat 


aatatattga 


6168 


gttagcttea 


agcttcctaa 


aatatgaaga 


gattgttgtc 


taaagtcaca 


tattgacatt 


6228 


gagctcagtg 


gcctgtttca 


tcacgtatgt 


gctgctacct 


gtacageaga 


catgecgetc 


6288 


eagtgacatt 


tataatgaca 


gaagcagggt 


aatggtcttg 


tgtttgaeat 


gatcagttag 


6348 


gatcatagac 


tttccctgac 


tcgtagatat 


tagccttgaa 


ttgggggaaa 


agaagacttt 


6408 


gacacatttt 


agttatttta 


ataacagaga 


tttactcttt 


tgaaaaataa 


aggtatctaa 


6468 


tgtctcccta 


ataagtcttc 


tttccttcca 


actaaatgae 


ctacacggac 


ttttattttc 


6528 


ttgatcaaag 


aggtgtttat 


taaggacttc 


tggataacta 


tacttttact 


ctatttttaa 


6588 


agatcacaaa 


gtaattttaa 


atgtgaacag 


gtt ccc at ae 


catgaatget 


ggcctcaect 


6648 


tctctatcat 


ceacattttg 


aaatgeaaag 


aaagctcect 


tgtaagccat 


acttccttcc 


6708 


ecactcccat 


ectaggatac 


ttgcccagtg 


ctcattagge 


atttcttatt 


cagatagtcc 


6768 


aaatttaggt 


tattatgctt 


aatttgacac 


attaactaaa 


tgcccagttt 


taaaatatat 


6828 


ccatcaattc 


acgctgaaat 


gtgcttcttt 


gtgctatcaa 


atggaataga 


atacacttat 


6888 


tttttaaaca 


atcccagaat 


actgtgtgta 


gacttttgtt 


gtgctcaaat 


aaatgtttac 


6948 


ttatcttaea 


aagctcaaat 


actggattgt 


aaccatgtga 


tgaagttate 
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ctaacattgc aaatHaatca ataaatctct gttgtcaaaa aaaaaaaaaa aaaa 7062 

<210> 39 

<211> 1700 

<212> PRT 

<213> Homo sapiens 

<400> 39 

Met Asp Gly Asn Ser Leu Leu Ser Val Pro Ser Asn Leu Glu Ser Ser 
15 10 15 

Arg Met Tyr Asp Val Leu Glu Pro Gin Gin Gly Arg Gly Cys Gly Ser 
20 25 30 

Ser Gly Ser Gly Pro Gly Asn Ser lie Thr Ala Cys Lys Lys Val Leu 
35 40 45 

Arg Ser Asn Ser Leu Leu Glu Ser Thr Asp Tyr Trp Leu Gin Asn Gin 
50 55 60 

Arg Met Pro Cys Gin lie Gly Phe Val Glu Asp Lys Ser Glu Asn Cys 
65 70 75 80 

Ala Ser Val Cys Phe Val Asn Leu Asp Val Asn Lys Asp Glu Cys Ser 
85 90 95 

Thr Glu His Leu Gin Gin Lys Leu Val Asn Val Ser Pro Asp Leu Pro 
100 105 110 

Lys Leu lie Ser Ser Met Asn Val Gin Gin Pro Lys Glu Asn Glu lie 
115 120 125 

Val Val Leu Ser Gly Leu Ala Ser Gly Asn Leu Gin Ala Asp Phe Glu 
130 135 140 

val Ser Gin Cys Pro Trp Leu Pro Asp lie Cys Leu Val Gin Cys Ala 
145 150 155 160 

Arg Gly Asn Arg Pro Asn Ser Thr Asn Cys lie lie Phe Glu He Asn 
165 170 175 

Lys Phe Leu He Gly Leu Glu Leu Val Gin Glu Arg Gin Leu His Leu 
180 185 190 

Glu Thr Asn He Leu Lys Leu Glu Asp Asp Thr Asn Cys Ser Leu Ser 
195 200 205 

Ser He Glu Glu Asp Phe Leu Thr Ala Ser Glu His Leu Glu Glu Glu 
210 215 220 

Ser Glu Val Asp Glu Ser Arg Asn Asp Tyr Glu Asn He Asn Val Ser 
225 230 235 240 

Ala Asn Val Leu Glu Ser Lys Gin Leu Lys Gly Ala Thr Gin Val Glu 
245 250 255 

Trp T^n Cys Asn Lys Glu Lys Trp Leu Tyr Ala Leu Glu Asp Lys Tyr 
260 265 270 
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He Asn Lys Tyr Pro Thr Pro lieu He I»ys Thr Glu Arg Ser Pro Glu 
275 280 285 



Asn Leu Thr Lys Asn Thr Ala Leu Gin Ser Leu Asp Pro Ser Ala Lys 
290 295 300 

Pro Ser Gin Trp Lys Arg Glu Ala Val Gly Asn Gly Arg Gin Ala Thr 
305 310 315 320 

His Tyr Tyr His Ser Glu Ala Phe Lys Gly Gin Met Glu Lys Ser Gin 
325 330 335 



Ala Leu Tyr He Pro Lys Asp Ala Tyr Phe Ser Met Met Asp Lys Asp 
340 345 350 



Val Pro Ser Ala Cys Ala Val Ala Glu Gin Arg Ser Asn Leu Asn Pro 
355 360 365 

Gly Asp His Glu Asp Thr Arg Asn Ala Leu Pro Pro Arg Gin Asp Gly 
370 375 380 



Glu Val Thr Thr Gly Lys Tyr Ala Thr Asn Leu Ala Glu Ser Val Leu 
385 390 395 400 

Gin Asp Ala Phe He Arg Leu Ser Gin Ser Gin Ser Thr Leu Pro Gin 
405 410 415 



Glu Ser Ala Val Ser Val Ser Val Gly Ser Ser Leu Leu Pro Ser Cys 
420 425 430 



Tyr Ser Thr Lys Asp Thr Val Val Ser Arg Ser Trp Asn Glu Leu Pro 
435 440 445 



Lys He Val Val Val Gin Ser Pro Asp Gly Ser Asp Ala Ala Pro Gin 
450 455 460 



Pro Gly He Ser Ser Trp Pro Glu Met Glu Val Ser Val Glu Thr Ser 
465 470 475 460 



Ser He Leu Ser Gly Glu Asn Ser Ser Arg Gin Pro Gin Ser Ala Leu 
485 490 495 



Glu Val Ala Leu Ala Cys Ala Ala Thr Val He Gly Thr He Ser Ser 
500 505 510 



Pro Gin Ala Thr Glu Arg Leu Lys Met Glu Gin Val Val Ser Asn Phe 
515 520 525 



Pro Pro Gly Ser Ser Gly Ala Leu Gin Thr Gin Ala Pro Gin Gly Leu 
530 535 540 



Lys Glu Pro Ser He Asn Glu Tyr Ser Phe Pro Ser Ala Leu Cys Gly 
545 550 555 560 



Met Thr Gin Val Ala Ser Ala Val Ala Val Cys Gly Leu Gly Glu Arg 
565 570 575 



Glu Glu Val Thr Cys Ser Val Ala Pro Ser Gly Ser Leu Pro Pro Ala 
580 585 590 



Page 49 



wo 03/085095 



PCTAJS03/09921 



160 100 PCT.ST25 

Ala Glu Ala Ser Glu Ala Met Fro Pro Leu Cys Gly I*eu Ala Ser Met 
595 600 605 

Glu Leu Gly Lys Glu Ala lie Ala Glu Gly Leu Leu Lys Glu Ala Ala 
610 615 620 

Leu Val Leu Thr Arg Pro Asn Thr Tyr Ser Ser He Gly Asp Phe Leu 
625 630 635 640 

Asp Ser Met Asn Arg Arg He Met Glu Thr Ala Ser Lys Ser Gin Thr 
645 650 655 

Leu Cys Ser Glu Asn Val Val Arg Asn Glu Leu Ala His Thr Leu Ser 
660 665 670 



Asn Val He Leu Arg His Ser He Asp Glu Val His His Lys Asn Met 
675 680 685 

He He Asp Pro Asn Asp Asn Arg His Ser Ser Glu He Leu Asp Thr 
690 695 700 

Leu Met Glu Ser Thr Asn Gin Leu Leu Leu Asp Val He Cys Phe Thr 
705 710 715 720 

Phe Lys Lys Met Ser His He Val Arg Leu Gly Glu Cys Pro Ala Val 
725 730 735 

Leu Ser Lys Glu Thr He Arg Arg Arg Glu Thr Glu Pro Ser Cys Gin 
740 745 750 



Pro Ser Asp Pro Gly Ala Ser Gin Ala Trp Thr Lys Ala Thr Glu Ser 
755 760 765 

Ser Ser Ser Ser Pro Leu Ser Asn Ser His Asn Thr Ser Leu Val He. 
770 775 780 

Asn Asn Leu Val Asp Gly Met Tyr Ser Lys Gin Asp Lys Gly Gly Val 
785 790 795 800 



Arg Pro Gly Leu Phe Lys Asn Pro Thr Leu Gin Ser Gin Leu Ser Arg 
805 810 815 



Ser His Arg Val Pro Asp Ser Ser Thr Ala Thr Thr Ser Ser Lys Glu 
820 825 830 



He Tyr Leu Lys Gly He TVla Gly Glu Asp Thr Lys Ser Pro His His 
835 840 845 



Ser Glu Asn Glu Cys Arg Ala Ser Ser Glu Gly Gin Arg Ser Pro Thr 
850 855 860 



Val Ser Arg Ser Arg Ser Gly Ser Gin Glu Ala Glu Glu Ser He His 
865 870 875 880 

Pro Asn Thr Gin Glu Lys Tyr Asn Cys Ala Thr Ser Arg He Asn Glu 
885 890 895 

Val Gin Val Asn Leu Ser Leu Leu Gly Asp Asp Leu Leu Leu Pro Ala 
900 905 910 



Page 50 



wo 03/085095 PCTAJS03/09921 



16U 100 PCT.ST25 

Gin Ser Thr Leu Gin Thr Lys His Pro Asp lie Tyr Cys lie Thr Asp 
915 920 925 

Phe Ala Glu Glu Leu Ala Asp Thr Val Val Ser Met Ala Thr Glu Zle 
930 935 940 

Ala Ala lie Cys Leu Asp Asn Ser Ser Gly Lys Gin Pro Trp Phe Cys 
945 950 955 960 

Ala Trp Lys Arg Gly Ser Glu Phe Leu Met Thr Pro Asn Val Pro Cys 
965 970 975 

Arg Ser Leu Lys Arg Lys Lys Glu Ser Gin Gly Ser Gly Thr Ala Val 
980 985 990 

Arg Lys His Lys Pro Pro Arg Leu Ser Glu lie Lys Arg Lys Thr Asp 
995 1000 1005 

Glu His Pro Glu Leu Lys Glu Lys Leu Met Asn Arg Val Val Asp 
1010 1015 1020 

Glu Ser Met Asn Leu Glu Asp Val Pro Asp Ser Val Asn Leu Phe 
1025 1030 1035 

Ala Asn Glu Val Ala Ala Lys lie Met Asn Leu Thr Glu Phe Ser 
1040 1045 1050 

Met Val Asp Gly Met Trp Gin Ala Gin Gly Tyr Pro Arg Asn Arg 
1055 1060 1065 

Leu Leu Ser Gly Asp Arg Trp Ser Arg Leu Lys Ala Ser Ser Cys 
1070 1075 1080 

Glu Ser He Pro Glu Glu Asp Ser Glu Ala Arg Ala Tyr Val Asn 
1085 1090 1095 

Ser Leu Gly Leu Met Ser Thr Leu Ser Gin Pro Val Ser Arg Ala 
1100 1105 1110 

Ser Ser Val Ser Lys Gin Ser Ser Cys Glu Ser He Thr Asp Glu 
1115 1120 1125 

Phe Ser Arg Phe Met Val Asn Gin Met Glu Asn Glu Gly Arg Gly 
1130 1135 . 1140 

Phe Glu Leu Leu Leu Asp Tyr Tyr Ala Gly Lys Asn Ala Ser Ser 
1145 1150 1155 

He Leu Asn Ser Ala Met Gin Gin Ala Cys Arg Lys Ser Asp His 
1160 1165 1170 

Leu Ser Val Arg Pro Ser Cys Pro Ser Lys Gin Ser Ser Thr Glu 
1175 1180 1185 

Ser lie Thr Glu Glu Phe Tyr Arg Tyr Met Leu Arg Asp He Glu 
1190 1195 1200 

Arg Asp Ser Arg Glu Ser Ala Ser Ser Arg Arg Ser Ser Gin Asp 
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1205 1210 1215 

Trp Thr Ala Gly Leu Leu Ser Pro Ser Leu Arg Ser Pro Val Cys 
1220 1225 1230 

His Arg Gin Ser Ser Met Pro Asp Ser Arg Ser Pro Cys Ser Arg 
1235 1240 1245 

Leu Thr Val Asn val Pro lie Lys Ala Asn Ser Leu Asp Gly Phe 
1250 1255 1260 

Ala Gin Asn Cys Pro Gin Asp Phe Leu Ser Val Gin Pro Val Ser 
1265 1270 1275 

Ser Ala Ser Ser Ser Gly Leu Cys Lys Ser Asp Ser Cys Leu Tyr 
1280 1285 1290 

Arg Arg Gly Gly Thr Asp His lie Thr Asn Met Leu He His Glu 
1295 1300 1305 

Thr Trp Ala Ser Ser He Glu Ala Leu Met Arg Lys Asn Lys He 
1310 1315 1320 

He Val Asp Asp Ala Glu Glu Ala Asp Thr Glu Pro Val Ser Gly 
1325 1330 1335 

Gly Ser Pro Ser Gin Ala Glu Lys Cys Ala Asn Arg Leu Ala Ala 
1340 1345 1350 

Ser Arg Met Cys Ser Gly Pro Thr Leu Leu Val Gin Glu Ser Leu 
1355 1360 1365 

Asp Cys Pro Arg Lys Asp Ser Val Thr Glu Cys Lys Gin Pro Pro 
1370 1375 1380 

Val ser Ser Leu Ser Lys Thr Ala Ser Leu Thr Asn His Ser Pro 
1385 1390 1395 

Leu Asp Ser Lys Lys Glu Thr Ser Ser Cys Gin Asp Pro Val Pro 
1400 1405 1410 



He Asn His Lys Arg Arg Ser Leu Cys Ser Arg Glu Val Pro Leu 

1415 1420 1425 

He Gin He Glu Thr Asp Gin Arg Glu Ala Cys Ala Gly Glu Pro 

1430 1435 1440 



Glu Pro Phe Leu Ser Lys Ser Ser Leu Leu Glu Glu Ala Glu Gly 
1445 1450 1455 



His Ser Asn Asp Lys Asn He Pro Asp Val Val Arg Gly Gly Asp 
1460 1465 1470 



Thr Ala Val Ser Ala Cys Gin He His Ser Asp Ser Leu Asp Thr 
1475 1480 1485 



Arg Asp Val Pro Glu Ala Glu Ala Ser Thr Glu Ala Arg Ala Pro 
1490 1495 1500 
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Asp Glu Ala Pro Asn Pro Pro Ser Ser Ser Glu Glu Ser Thr Gly 
1505 1510 1515 

Ser Trp Thr Gin Leu Ala Asn Glu Glu Asp Asn Pro Asp Asp Thr 
1520 1525 1530 

Ser Ser Phe Leu Gin Leu Ser Glu Arg Ser Met Ser Asn Gly Asn 
1535 1540 1545 

Ser Ser Ala Thr Ser Ser Leu Gly lie Met Asp Leu Asp lie Tyr 
1550 1555 1560 

Gin Glu Ser Met Pro Ser Ser Pro Met lie Asn Glu Leu Val Glu 
1565 1570 1575 

p 

Glu Lys Lys lie Leu Lys Gly Gin Ser Glu Ser Thr Glu Ala Pro 
1580 1585 1590 

Ala Ser Gly Pro Pro Thr Gly Thr Ala Ser Pro Gin Arg Ser Leu 
1595 1600 1605 

Leu Val lie Asn Phe Asp Leu Glu Pro Glu Cys Pro Asp Ala Glu 
1610 1615 1620 

Leu Arg Ala Thr Leu Gin Trp lie Ala Ala Ser Glu Leu Gly He 
1625 1630 1635 

/ 

Pro Thr He Tyr Phe Lys Lys Ser Gin Glu Asn Arg He Glu Lys 
1640 1645 1650 

Phe Leu Asp Val Val Gin Leu Val His Arg Lys Ser Trp Lys Val 
1655 1660 1665 

Gly Asp He Phe His Ala Val Val Gin Tyr Cys Lys Met His Glu 
1670 1675 1680 

Glu Gin Lys Asp Gly Arg Leu Ser Leu Phe Asp Trp Leu Leu Glu 
1685 1690 1695 

Leu Gly 
1700 

<210> 40 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 40 

cgtcaggaat gaactggcac atacc 25 

<210> 41 
<211> 24 
<212> DNA 

<213> Homo sapiens; Homo sapiens 
<400> 41 

ctaagtggag agctgctgga ggat 24 

<210> 42 

<211> 1302 

<212> PRT 

<213> Homo sapiens 
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<400> 42 

Asp His 61u Asp Thr Arg Asn Ala Leu Pro Pro Arg Gin Asp Gly Glu 
IS 10 15 



Val Thr Thr Gly Lys Tyr Ala Thr Asn Zieu Ala Glu Ser Val Leu Gin 
20 25 30 



Asp Ala Phe He Arg Leu Ser Gin Ser Gin Ser Thr Leu Pro Gin Glu 
35 40 45 



Ser Ala Val Ser Val Ser Val Gly Ser Ser Leu Leu Pro Ser Cys Tyr 
50 55 60 



Ser Thr Lys Asp Thr Val Val Ser Arg Ser Trp Asn Glu Leu Pro Lys 
65 70 75 80 



He Val Val Val Gin Ser Pro Asp Gly Ser Asp Ala Ala Pro Gin Pro 
85 90 95 



Gly He Ser Ser Trp Pro Glu Met Glu Val Ser Val Glu Thr Ser Ser 
100 105 110 



He Leu Ser Gly Glu Asn Ser Ser Arg Gin Pro Gin Ser Ala Leu Glu 
115 120 125 



Val Ala Leu Ala Cys Ala Ala Thr Val He Gly Thr He Ser Ser Pro 
130 135 140 



Gin Ala Thr Glu Arg Leu Lys Met Glu Gin Val Val Ser Asn Phe Pro 
145 150 155 160 



Pro Gly Ser Ser Gly Ala Leu Gin Thr Gin Ala Pro Gin Gly Leu Lys 
165 170 175 



Glu Pro ser He Asn Glu Tyr Ser Phe Pro Ser Ala Leu Cys Gly Met 
180 185 190 



Thr Gin Val Ala Ser Ala Val Ala Val Cys Gly Leu Gly Glu Arg Glu 
195 200 205 



Glu Val Thr Cys Ser Val Ala Pro Ser Gly Ser Leu Pro Pro Ala Ala 
210 215 220 



Glu Ala Ser Glu Ala Met Pro Pro Leu Cys Gly Leu Ala Ser Met Glu 
225 230 235 240 

Leu Gly Lys Glu Ala He Ala Glu Gly Leu Leu Lys Glu Ala Ala Leu 
245 250 255 

Val Leu Thr Arg Pro Asn Thr Tyr Ser Ser He Gly Asp Phe Leu Asp 
260 265 270 



Ser Met Asn Arg Arg He Met Glu Thr Ala Ser Lys Ser Gin Thr Leu 
275 280 285 

Cys Ser Glu Asn Val Val Arg Asn Glu Leu Ala His Thr Leu Ser Asn 
290 295 300 



Val He Leu Arg His Ser He Asp Glu Val His His Lys Asn Met He 
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305 310 315 320 

lie Asp Pro Asn Asp Asn Arg His Ser Ser Glu lie Leu Asp Thr Leu 
325 330 335 

Met Glu Ser Thr Asn Gin Leu Leu Leu Asp Val lie Cys Phe Thr Phe 
340 345 350 

Lys Lys Met Ser His He Val Arg Leu Gly Glu Cys Pro Ala Val Leu 
355 360 365 

Ser Lys Glu Thr He Arg Arg Arg Glu Thr Glu Pro Ser Cys Gin Pro 
370 375 380 

Ser Asp Pro Gly Ala Ser Gin Ala Trp Thr Lys Ala Thr Glu Ser Ser 
385 390 395 400 

ser Ser Ser Pro Leu Ser Asn Ser His Asn Thr Ser Leu Val lie Asn 
405 410 415 

Asn Leu Val Asp Gly Met Tyr Ser Lys Gin Asp Lys Gly Gly Val Arg 
420 425 430 

Pro Gly Leu Phe Lys Asn Pro Thr Leu Gin Ser Gin Leu Ser Arg Ser 
435 440 445 

His Arg Val Pro Asp Ser Ser Thr Ala Thr Thr Ser Ser Lys Glu He 
450 455 460 

Tyr Leu Lys Gly He Ala Gly Glu Asp Thr Lys Ser Pro Gin His Ser 
465 470 475 480 

Glu Asn Glu Cys Arg Ala Ser Ser Glu Gly Gin Arg Ser Pro Thr Val 
485 490 495 

Ser Gin Ser Arg Ser Gly Ser Gin Glu Ala Glu Glu Ser He His Pro 
500 505 510 

Asn Thr Gin Glu Lys Tyr Asn Cys Ala Thr Ser Arg He Asn Glu Val 
515 520 525 



Gin Val Asn Leu Ser Leu Leu Gly Asp Asp Leu Leu Leu Pro Ala Gin 
530 535 540 



Ser Thr Leu Gin Thr Lys His Pro Asp He Tyr Cys He Thr Asp Phe 
545 550 555 560 



Ala Glu Glu Leu Ala Asp Thr Val Val Ser Met Ala Thr Glu He Ala 
565 570 575 



Ala He Cys Leu Asp Asn Ser Ser Gly Lys Gin Pro Trp Phe Cys Ala 
580 585 590 



Trp Lys Arg Gly Ser Glu Phe Leu Met Thr Pro Asn Val Pro Cys Arg 
595 600 605 



Ser Leu Lys Arg Lys Lys Glu Ser Gin Gly Ser Gly Thr Ala Val Arg 
610 615 620 
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Lys His Lys Pro Pro Arg Leu Ser Glu He Lys Arg Lys Thr Asp Glu 
625 630 635 640 

His Pro Glu Leu Lys Glu Lys Leu Met Asn Arg Val Val Asp Glu Ser 
645 650 655 

Met Asn Leu Glu Asp Val Pro Asp Ser Val Asn Leu Pbe Ala Asn Glu 
660 665 670 

Val Ala Ala Lys He Met Asn Leu Thr Glu Phe Ser Met Val Asp Gly 
675 680 685 

Met Trp Gin Ala Gin Gly Tyr Pro Arg Asn Arg Leu Leu Ser Gly Asp 
690 695 700 

Arg Trp Ser Arg Leu Lys Ala Ser Ser Cys Glu Ser He Pro Glu Glu 
70S 710 715 720 

Asp Ser Glu Ala Arg Ala Tyr Val Asn Ser Leu Gly Leu Met Ser Thr 
725 730 735 

Leu Ser Gin Pro Val Ser Arg Ala Ser Ser Val Ser Lys Gin Ser Ser 
740 745 750 

Cys Glu Ser He Thr Asp Glu Phe Ser Arg Phe Met Val Asn Gin Met 
755 760 765 

Glu Asn Glu Gly Arg Gly Phe Glu Leu Leu Leu Asp Tyr Tyr Ala Gly 
770 775 780 

Lys Asn Ala Ser Ser He Leu Asn Ser Ala Met Gin Gin Ala Cys Arg 
785 790 795 800 

Lys Ser Asp His Leu Ser Val Arg Pro Ser Cys Pro Ser Lys Gin Ser 
805 810 815 

Ser Thr Glu Ser He Thr Glu Glu Phe Tyr Arg Tyr Met Leu Arg Asp 
820 825 830 

He Glu Arg Asp Ser Arg Glu Ser Ala Ser Ser Arg Arg Ser Ser Gin 
835 840 845 

Asp Trp Thr Ala Gly Leu Leu Ser Pro Ser Leu Arg Ser Pro Val Cys 
850 855 860 

His Arg Gin Ser Ser Met Pro Asp Ser Arg Ser Pro Cys Ser Arg Leu 
865 870 875 880 

Thr Val Asn Val Pro He Lys Ala Asn Ser Leu Asp Gly Phe Ala Gin 
885 890 895 

Asn Cys Pro Gin Asp Phe Leu Ser Val Gin Pro Val Ser Ser Ala Ser 
900 905 910 

Ser Ser Gly Leu Cys Lys Ser Asp Ser Cys Leu Tyr Arg Arg Gly Gly 
915 920 925 

Thr Asp His He Thr Asn Met Leu He His Glu Thr Trp Ala Ser Ser 
930 935 940 
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lie Glu Ala Leu Met Arg Lys Asn Lys He He Val Asp Asp Ala Glu 
945 950 955 960 

Glu Ala Asp Thr Glu Pro Val Ser Gly Gly Ser Pro Ser Gin Ala Glu 
965 970 975 

Lys Cys Ala Asn Arg Leu Ala Ala Ser Arg Met Cys Ser Gly Pro Thr 
980 985 990 

Leu Leu Val Gin Glu Ser Leu Asp Cys Pro Arg Lys Asp Ser Val Thr 
995 1000 1005 

Glu Cys Lys Gin Pro Pro Val Ser Ser Leu Ser Lys Thr Ala Ser 
1010 1015 1020 

Leu Thr Asn His Ser Pro Leu Asp Ser Lys Lys Glu Thr Ser Ser 
1025 1030 1035 

Cys Gin Asp Pro Val Pro He Asn His Lys Arg Arg Ser Leu Cys 
1040 1045 1050 

Ser Arg Glu Val Pro Leu He Gin He Glu Thr Asp Gin Arg Glu 
1055 1060 1065 

Ala Cys Ala Gly Glu Pro Glu Pro Phe Leu Ser Lys Ser Ser Leu 
1070 1075 1080 

Leu Glu Glu Ala Glu Gly His Ser Asn Asp Lys Asn He Pro Asp 
1085 1090 1095 

Val Val Arg Gly Gly Asp Thr Ala Val Ser Ala Cys Gin He His 
1100 1105 1110 

Ser Asp Ser Leu Asp Thr Arg Asp Val Pro Glu Ala Glu Ala Ser 
1115 1120 1125 



Thr Glu Ala Arg Ala Pro Asp Glu Ala Pro Asn Pro Pro Ser Ser 
1130 1135 1140 



Ser Glu Glu Ser Thr Gly Ser Trp Thr Gin Leu Ala Asn Glu GXu 
1145 1150 1155 



Asp Asn Pro Asp Asp Thr Ser Ser Phe Leu Gin Leu Ser Glu Arg 
1160 1165 1170 



Ser Met Ser Glu Leu Val Glu Glu Lys Lys He Leu Lys Gly Gin 
1175 1180 1185 



Ser Glu Ser Thr Glu Ala Pro Ala Ser Gly Pro Pro Thr Gly Thr 
1190 1195 1200 



Ala Ser Pro Gin Arg Ser Leu Leu Val He Asn Phe Asp Leu Glu 
1205 1210 1215 



Fro Glu Cys Pro Asp Ala Glu Leu Arg Ala Thr Leu Gin Trp He 
1220 1225 1230 

Ala Ala Ser Glu Leu Gly He Pro Thr He Tyr Phe Lys Lys Ser 
1235 1240 1245 
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Gin Glu Asn Arg lie Glu Lys Phe Leu Asp Val Val Gin Leu Val 
1250 1255 1260 

His Arg Lys Ser Trp Lys Val Gly Asp He Phe His Ala Val Val 
1265 1270 1275 

Gin Tyr Cys Lys Met His Glu Glu Gin Lys Asp Gly Arg Leu Ser 
1280 1285 1290 

Leu Phe Asp Trp Leu Leu Glu Leu Gly 
1295 1300 

<210> 43 

<211> 3369 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (250) . . (2793) 

<223> 

<400> 43 

gctggatcaa gctgtgaacg tgatttgctg gaagctggtt gacgatgtgt cacactgtgt 60 

aagggaatcg catggagatg ggcattccga actgttaatg gggacatggg actccagttg 120 

tctctgatca cttgtgtgga ttttcctggc gtagaacgac agaagccgct agtaagtcgc 180 

caagacctac agcaggaatt ctgcaccaaa gggcataaaa tcttgttatt ttaatttgca 240 

tctgggaga atg tct gag caa gga gac ctg aat cag gca ata gca gag gaa 291 
Met Ser Glu Gin Gly Asp Leu Asn Gin Ala He Ala Glu Glu 
15 10 

gga ggg act gag cag gag acg gcc act cca gag aac ggc att gtt aaa 339 
Gly Gly Thr Glu Gin Glu Thr Ala Thr Pro Glu Asn Gly He Val Lys 
15 20 25 30 

tea gaa agt ctg gat gaa gag gag aaa ctg gaa ctg cag agg egg ctg 387 
Ser Glu Ser Leu Asp Glu Glu Glu Lys Leu Glu Leu Gin Arg Arg Leu 
35 40 45 

gag get cag aat caa gaa aga aga aaa tec aag tea gga gca gga aaa 435 
Glu Ala Gin Asn Gin Glu Arg Arg Lys Ser Lys Ser Gly Ala Gly Lys 

50 55 60 

ggt aaa ctg act cgc agt ctt get gtc tgt gag gaa tct tct gcc aga 483 
Gly Lys Leu Thr Arg Ser Leu Ala Val Cys Glu Glu Ser Ser Ala Arg 
65 70 75 

cca gga ggt gaa agt ctt cag gat cag gaa tea att cat tta cag ctt 531 
Pro Gly Gly Glu Ser Leu Gin Asp Gin Glu Ser He His Leu Gin Leu 
80 85 90 

tec agt ttt tec age ctg caa gag gag gat aaa tct agg aaa gat gac 579 
Ser ser Phe Ser Ser Leu Gin Glu Glu Asp Lys Ser Arg Lys Asp Asp 
95 100 105 110 

tct gaa aga gaa aaa gaa aag gat aaa aac aaa gat aaa acc tct gaa 627 
Ser Glu Arg Glu Lys Glu Lys Asp Lys Asn Lys Asp Lys Thr Ser Glu 
115 120 125 

aaa ccc aag ate aga atg tta tea aaa gat tgc age caa gaa tac acg 675 
Lys Pro Lys He Arg Met Leu Ser Lys Asp Cys Ser Gin Glu Tyr Thr 
130 135 140 

gat tct aca ggc ata gac tta cac gag ttt ctg att aac aca tta aag 723 
Asp Ser Thr Gly He Asp Leu His Glu Phe Leu He Asn Thr Leu Lys 
145 150 155 

aat aat tec agg gac agg atg ata ctt ttg aaa atg gag cag gaa att 771 
Asn Asn Ser Arg Asp Arg Met He Leu Leu Lys Met Glu Gin Glu He 
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160 165 170 

att gat ttc att get gac aac aat aat cat tat aaa aag ttc cct cag 819 
lie Asp Phe lie Ala Asp Asn Asn Asn His Tyr Lys Lys Phe Pro Gin 
175 180 185 190 

atg tea teg tat cag agg atg ctt gtc cat cga gtg gea get tat ttt 867 
Met Ser Set Tyr Gin Arg Met Leu Val His Arg Val Ala Ala Tyr Phe 
195 200 20S 

gga ttg gat cac aat gtg gat caa aca gga aaa tct gtt ate ate aac 915 
Gly Leu Asp His Asn Val Asp Gin Thr Gly Lys Ser Val lie lie Asn 
210 215 220 

aag aec age age ace aga at a cca gag caa agg ttt tgt gaa eat tta 963 
Lys Thr Ser Ser Thr Arg He Pro Glu Gin Arg Phe Cys Glu His Leu 

225 230 235 

aaa gat gaa aaa ggt gaa gaa tec cag aag egg ttt ate ttg aag ega 1011 
Lys Asp Glu Lys Gly Glu Glu Ser Gin Lys Arg Phe He Leu Lys Arg 
240 245 250 

gat aac tct agt att gat aaa gaa gac aat cag caa aac aga atg cat 1059 
Asp Asn ser Ser He Asp Lys Glu Asp Asn Gin Gin Asn Arg Met His 
255 260 265 270 

cca ttt aga gat gac aga cga agt aaa tea att gaa gag aga gaa gag 1107 
Pro Phe Arg Asp Asp Arg Arg Ser Lys Ser He Glu Glu Arg Glu Glu 
275 280 285 

gaa tat cag aga gtg agg gag aga ata ttt gea cac gat tea gtt tgc 1155 
Glu Tyr Gin Arg Val Arg Glu Arg He Phe Ala His Asp Ser Val Cys 
290 295 300 

tec cag gaa age ctt ttt gtg gaa aac agt agg etc ttg gaa gac agt 1203 
Ser Gin Glu Ser Leu Phe Val Glu Asn Ser Arg Leu Leu Glu Asp Ser 
305 310 315 

aac ata tgc aat gag acc tat aag aaa aga cag etc ttt egg ggc aac 1251 
Asn He Cys Asn Glu Thr Tyr Lys Lys Arg Gin Leu Phe Arg Gly Asn 
320 325 330 

aga gat ggc tea ggg aga aca tct ggg agt cga cag age age tea gaa 1299 
Arg Asp Gly Ser Gly Arg Thr Ser Gly Ser Arg Gin Ser Ser Ser Glu 
335 340 345 350 

aat gaa etc aag tgg tct gac cac caa agg gcc tgg age age aca gac 1347 
Asn Glu Leu Lys Trp Ser Asp His Gin Arg Ala Trp Ser Ser Thr Asp 
355 360 365 

tec gac agt tec aac cgc aat eta aag ccc gcc atg acc aag acg gcg 1395 
Ser Asp Ser Ser Asn Arg Asn Leu Lys Pro Ala Met Thr Lys Thr Ala 
370 375 380 

agt ttt ggg ggc ate acg gtg ctg acc agg ggt gac age act tec agt 1443 
ser Phe Gly Gly He Thr Val Leu Thr Arg Gly Asp Ser Thr Ser Ser 

385 390 395 

act agg agt acc ggg aag ctg tec aaa gea ggt tec gag tct tec age 1491 
Thr Arg Ser Thr Gly Lys Leu Ser Lys Ala Gly Ser Glu Ser Ser Ser 
400 405 410 

agt gea ggc tec tea gga teg ctg tee cgc acc eat cca cct etc cag 1539 
Ser Ala Gly Ser Ser Gly Ser Leu Ser Arg Thr His Pro Pro Leu Gin 
415 420 425 430 

age aca ccc eta gtc tea ggt gtg gea get ggc tct cca ggc tgt gtg 1587 
Ser Thr Pro Leu Val Ser Gly Val Ala Ala Gly Ser Pro Gly Cys Val 
435 440 445 

cct tat cca gag aat gga ata ggg ggc cag gtt get ccc age age acc 1635 
Pro Tyr Pro Glu Asn Gly He Gly Gly Gin Val Ala Pro Ser Ser Thr 

450 455 460 

age tac ate etc ctt cca ctt gaa get gea aca ggc ate ccg cct gga 1683 
Ser Tyr He Leu Leu Pro Leu Glu Ala Ala Thr Gly He Pro Pro Gly 
465 470 475 

age ate ctt ctt aat cca cac aca ggc cag ccc ttt gtg aat ccc gat 1731 
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Ser He Leu Leu Asn Pro His Thr Gly Gin Pro Phe Val Asn Pro Asp 
480 485 490 

gga act cct gca ata tac aac cca ccc acc agt cag cag ccc ctg cga 1779 
Gly Thr Pro Ala lie Tyr Asn Pro Pro Thr Ser Gin Gin Pro Leu Arg 
495 500 505 510 

age gcc atg gtg ggg cag tec caa cag cag cca cea cag cag cag ccc 1827 
Ser Ala Met Val Gly Gin Ser Gin Gin Gin Pro Pro Gin Gin Gin Pro 
515 520 525 

tee ccg cag ccc caa cag cag gtc cag cca ccg cag cca cag atg gca 1875 
Ser Pro Gin Pro Gin Gin Gin Val Gin Pro Pro Gin Pro Gin Met Ala 
530 535 540 

ggc cct ctg gtc act cag tct gtc cag ggg ctg cag get tec tec cag 1923 
Gly Pro Leu Val Thr Gin Ser Val Gin Gly Leu Gin Ala Ser Ser Gin 
545 550 555 

tea gtg caa tat cca gca gtc tct ttt cct ccc cag cac etc eta cct 1971 
Ser Val Gin Tyr Pro Ala Val Ser Phe Pro Pro Gin His Leu Leu Pro 
560 565 570 

gtg tct cea aeg cag cac ttt ccc atg aga gat gat gtg gca aca cag 2019 
Val Ser Pro Thr Gin His Phe Pro Met Arg Asp Asp Val Ala Thr Gin 
575 580 585 590 

ttt ggc cag atg acc ctg age egg cag tec teg ggg gag act cct gaa 2067 
Phe Gly Gin Met Thr Leu Ser Arg Gin Ser Ser Gly Glu Thr Pro Glu 
595 600 605 

ccc cca tea ggt cct gtc tac cea tec tec ctt atg cca cag ccg gee 2115 
Pro Pro Ser Gly Pro Val Tyr Pro Ser Ser Leu Met Pro Gin Pro Ala 
610 615 620 

cag cag ccc age tat gta ate gee tct aca ggc cag cag ctt cct aca . 2163 
Gin Gin Pro Ser Tyr Val He Ala Ser Thr Gly Gin Gin Leu Pro Thr 
625 630 635 

gga gga tte tea ggc tct ggc cct ccc ate tec cag cag gtc etc cag 2211 
Gly Gly" Phe Ser Gly Ser Gly Pro Pro He Ser Gin Gin val Leu Gin 
640 645 650 

ccc cct ccc tea cca cag gga ttt gtg caa cag cct ccg cct gca cag 2259 
Pro Pro Pro Ser Pro Gin Gly Phe Val Gin Gin Pro Pro Pro Ala Gin 
655 660 665 670 

atg cct gta tat tat tac cca tct ggt cag tac cct ace tea acc acg 2307 
Met Pro Val Tyr Tyr Tyr Pro Ser Gly Gin Tyr Pro Thr Ser Thr Thr 

675 680 685 

caa cag tac egg ccc atg gee ccg gtt cag tac aac get cag agg agt 2355 
Gin Gin Tyr Arg Pro Met Ala Pro Val Gin Tyr Asn Ala Gin Arg Ser 
690 695 700 

caa cag atg cca cag gca gca cag caa gca ggt tac cag cca gtc ttg 2403 
Gin Gin Met Pro Gin Ala Ala Gin Gin Ala Gly Tyr Gin Pro Val Leu 
705 710 715 

tct ggt caa cag gga ttc caa ggc eta ata gga gtg cag cag cca cct 2451 
Ser Gly Gin Gin Gly Phe Gin Gly Leu He Gly Val Gin Gin Pro Pro 
720 725 730 

cag agt cag aac gtg ata aat aac caa caa gga act ccg gtg caa age 2499 
Gin Ser Gin Asn Val He Asn Asn Gin Gin Gly Thr Pro Val Gin Ser 
735 740 745 750 

gtg atg gtt tec tac cca aca atg tct tct tat cag gtg cca atg acc 2547 
Val Met Val Ser Tyr Pro Thr Met Ser Ser Tyr Gin Val Pro Met Thr 
755 760 765 

cag ggt tct caa gga ctg ccc cag cag tea tac caa cag cca ate atg 2595 
Gin Gly Ser Gin Gly Leu Pro Gin Gin Ser Tyr Gin Gin Pro He Met 
770 775 780 

eta cct aac cag gca ggt caa ggg tea etc cca gee act gga atg cct 2643 
Leu Pro Asn Gin Ala Gly Gin Gly Ser Leu Pro Ala Thr Gly Met Pro 
785 790 795 
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gtt tac tgt aat gtc aca ccg ccc acc cct cag aac aac ctt agg ctg 2691 
Val Tyr Cys Asn Val Thr Pro Pro Thr Pro Gin Asn Asn Leu Arg Leu 
800 805 810 

att ggc cca cac tgc ccc tec age act gtc cea gtg atg tea get age 2739 
lie Gly Pro His Cys Pro Ser Ser Thr Val Pro Val Met Ser Ala Ser 
815 820 825 830 

tgc aga aca aac tgt gca agt atg age aat get ggt tgg cag gtc aaa 2787 
Cys Arg Thr Asn Cys Ala Ser Met Ser Asn Ala Gly Trp Gin Val Lys 
835 840 845 

ttc tga gagctctgge tgtggtacat ttcttcagat atttctcatg gcctttgatg 2843 
Phe 



gaagaggaac 


aaggtgggaa 


aactggctga 


ggacttaagt attcactcaa 


cactcaaatg 


2903 


attgctgctg 


gtattctgta 


aaaaataaac 


aaagactaat atacacgtta 


gctggttaat 


2963 


ggtgcatatt 


tctgtcatgt 


ctgctaggta 


tgcctttata gcttagctag tgacatgaat 


3023 


teateaaggt 


aagattttct 


cctaccactg 


aataccactg tgtagattat 


aatatcccta 


3083 


atttggatta 


gtt t tgt act 


ttgtgttgag 


tttgtgatgc taaaagtatt 


taaaaattat 


3143 


atactaaatc 


acattgtacc 


aaagctgtaa 


tggaaaagca aagaagaatt 


gatgaattga 


3203 


aggaataatt 


tatatacatt 


atagagtttt 


cttttttaat ggatatatac 


tgtattgtag 


3263 


tgtttaatca 


aaataaaact 


atttgacctt 


atggaggaag gtcatgtttt taccaccaaa 


3323 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa aaaaaa 




3369 



<210> 44 

<211> 847 

<212> PRT 

<213> Homo sapiens 

<400> 44 

Met Ser Glu Gin Gly Asp Leu Asn Gin Ala lie Ala Glu Glu Gly Gly 
15 10 15 



Thr Glu Gin Glu Thr Ala Thr Pro Glu Asn Gly lie Val Lys Ser Glu 
20 25 30 



Ser Leu Asp Glu Glu Glu Lys Leu Glu Leu Gin Arg Arg Leu Glu Ala 
35 40 45 



Gin Asn Gin Glu Arg Arg Lys Ser Lys Ser Gly Ala Gly Lys Gly Lys 
50 55 60 



Leu Thr Arg Ser Leu Ala Val Cys Glu Glu Ser Ser Ala Arg Pro Gly 
65 70 75 80 



Gly Glu Ser Leu Gin Asp Gin Glu Ser lie His Leu Gin Leu Ser Ser 
85 90 95 



Phe Ser Ser Leu Gin Glu Glu Asp Lys Ser Arg Lys Asp Asp Ser Glu 
100 105 110 



Arg Glu Lys Glu Lys Asp Lys Asn Lys Asp Lys Thr Ser Glu Lys Pro 
115 120 125 

Lys lie Arg Met Leu Ser Lys Asp Cys Ser Gin Glu Tyr Thr Asp Ser 
130 135 140 



Thr Gly lie Asp Leu His Glu Phe Leu lie Asn Thr Leu Lys Asn Asn 
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145 150 155 160 

Ser Arg Asp Arg Met lie Leu lieu Lys Met Glu Gin Glu lie lie Asp 
165 170 175 



Pbe lie Ala Asp Asn Asn Asn His Tyr I>ys Lys Fhe Pro Gin Met Ser 
180 . 185 190 

Ser Tyr Gin Arg Met Leu Val His Arg Val Ala Ala Tyr Phe Gly Leu 
195 200 205 



Asp His Asn Val Asp Gin Thr Gly Lys Ser Val lie lie Asn Lys Thr 
210 215 220 



Ser Ser Thr Arg lie Pro Glu Gin Arg Phe Cys Glu His Leu Lys Asp 
225 230 235 240 



Glu Lys Gly Glu Glu Ser Gin Lys Arg Phe He Leu Lys Arg Asp Asn 
245 250 255 



Ser Ser He Asp Lys Glu Asp Asn Gin Gin Asn Arg Met His Pro Phe 
260 265 270 



Arg Asp Asp Arg Arg Ser Lys Ser lie Glu Glu Arg Glu Glu Glu Tyr 
275 280 285 



Gin Arg Val Arg Glu Arg He Phe Ala His Asp Ser Val Cys Ser Gin 
290 295 300 



Glu Ser Leu Phe Val Glu Asn Ser Arg Leu Leu Glu Asp Ser Asn He 
305 310 315 320 



Cys Asn Glu Thr Tyr Lys Lys Arg Gin Leu Phe Arg Gly Asn Arg Asp 
325 330 335 



Gly Ser Gly Arg Thr Ser Gly Ser Arg Gin Ser Ser Ser Glu Asn Glu 
340 345 350 



Leu Lys Trp Ser Asp His Gin Arg Ala Trp Ser Ser Thr Asp Ser Asp 
355 360 365 



Ser Ser Asn Arg Asn Leu Lys Pro Ala Met Thr Lys Thr Ala Ser Phe 
370 375 380 



Gly Gly He Thr Val Leu Thr Arg Gly Asp Ser Thr Ser Ser Thr Arg 
385 390 395 400 



Ser Thr Gly Lys Leu Ser Lys Ala Gly Ser Glu Ser Ser Ser Ser Ala 
405 410 415 

Gly Ser Ser Gly Ser Leu Ser Arg Thr His Pro Pro Leu Gin Ser Thr 
420 425 430 

Pro Leu Val Ser Gly Val Ala Ala Gly Ser Pro Gly Cys Val Pro Tyr 
435 440 445 

Pro Glu Asn Gly He Gly Gly Gin Val Ala Pro Ser Ser Thr Ser Tyr 
450 455 4 60 
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He Leu Leu Pro Leu Glu Ala Ala Thr Gly He Pro Pro Gly Ser He 
465 470 475 480 

Leu Leu Asn Pro His Thr Gly Gin Pro Phe Val Asn Pro Asp Gly Thr 
4B5 490 495 

Pro Ala He Tyr Asn Pro Pro Thr Ser Gin Gin Pro Leu Arg Ser Ala 
500 505 510 

Met Val Gly Gin Ser Gin Gin Gin Pro Pro Gin Gin Gin Pro Ser Pro 
515 520 525 

Gin Pro Gin Gin Gin Val Gin Pro Pro Gin Pro Gin Met Ala Gly Pro 
530 535 540 

Leu Val Thr Gin Ser Val Gin Gly Leu Gin Ala Ser Ser Gin Ser Val 
545 550 555 560 

Gin Tyr Pro Ala Val Ser Phe Pro Pro Gin His Leu Leu Pro val Ser 
565 570 575 

Pro Thr Gin His Phe Pro Met Arg Asp Asp Val Ala Thr Gin Phe Gly 
580 585 590 

Gin Met Thr Leu Ser Arg Gin Ser Ser Gly Glu Thr Pro Glu Pro Pro 
595 600 605 

Ser Gly Pro Val Tyr Pro Ser Ser Leu Met Pro Gin Pro Ala Gin Gin 
610 615 620 

Pro Ser Tyr Val He Ala Ser Thr Gly Gin Gin Leu Pro Thr Gly Gly 
625 630 635 640 

Phe Ser Gly Ser Gly Pro Pro He Ser Gin Gin Val Leu Gin Pro Pro 
645 650 655 



Pro Ser Pro Gin Gly Phe Val Gin Gin Pro Pro Pro Ala Gin Met Pro 

660 665 670 

Val Tyr Tyr Tyr Pro Ser Gly Gin Tyr Pro Thr Ser Thr Thr Gin Gin 

675 680 685 



Tyr Arg Pro Met Ala Pro Val Gin Tyr Asn Ala Gin Arg Ser Gin Gin 
690 695 700 

Met Pro Gin Ala Ala Gin Gin Ala Gly Tyr Gin Pro Val Leu Ser Gly 
705 710 715 720 



Gin Gin Gly Phe Gin Gly Leu He Gly Val Gin Gin Pro Pro Gin Ser 
725 730 735 



Gin Asn Val He Asn Asn Gin Gin Gly Thr Pro Val Gin Ser Val Met 
740 745 750 

Val Ser Tyr Pro Thr Met Ser Ser Tyr Gin Val Pro Met Thr Gin Gly 
755 760 765 

Ser Gin Gly Leu Pro Gin Gin Ser Tyr Gin Gin Pro He Met Leu Pro 
770 775 780 



Page 63 



wo 03/085095 



PCTAJS03/09921 



16D 100 PCT.ST25 

Asn Gin Ala Gly Gin Gly Ser Leu Pro Ala Thr Gly Met Pro Val Tyr 
785 790 795 800 

Cys Asn Val Thr Pro Pro Thr Pro Gin Asn Asn Leu Arg Leu He Gly 
805 810 815 

Pro His Cys Pro Ser Ser Thr Val Pro Val Met Ser Ala Ser Cys Arg 
820 825 830 

Thr Asn Cys Ala Ser Met Ser Asn Ala Gly Trp Gin Val Lys Phe 
835 840 845 

<210> 45 

<211> 3374 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (329) (2812) 

<223> 

<400> 45 

gtctattttt aatgctattt aatgaaggag cgagcgcctc actcagcaat aaaagaagca 60 

tgagggaaga cagagcagtg catggttatg gatactggac aaggatattt ggaaaggttg 120 

acgatgtgtc acactgtgta agggaatcgc atggagatgg gcattccgaa ctgttaatgg 180 

ggacatggga ctccagttgt ctctgatcac ttgtgtggat tttcctggcg tagaacgaca 240 

gaagccgcta gtaagtcgcc aagacctaca gcaggaattc tgcaccaaag ggcataaaat 300 

cttgttattt taatttgcat ctgggaga atg tct gag caa gga gac ctg aat 352 

Met Ser Glu Gin Gly Asp Leu Asn 
1 5 

cag gca ata gca gag gaa gga ggg act gag cag gag acg gcc act cca 400 
Gin Ala He Ala Glu Glu Gly Gly Thr Glu Gin Glu Thr Ala Thr Pro 

10 15 20 

gag aac ggc att gtt aaa tea gaa agt ctg gat gaa gag gag aaa ctg 448 
Glu Asn Gly He Val Lys Ser Glu Ser Leu Asp Glu Glu Glu Lys Leu 
25 30 35 40 

gaa ctg cag agg egg ctg gag get cag aat caa gaa aga aga aaa tec 496 
Glu Leu Gin Arg Arg Leu Glu Ala Gin Asn Gin Glu Arg Arg Lys Ser 
45 50 55 

aag tea gga gca gga aaa ggt aaa ctg act cgc agt ctt get gtc tgt 544 
Lys Ser Gly Ala Gly Lys Gly Lys Leu Thr Arg Ser Leu Ala Val Cys 
60 65 70 

gag gaa tct tct gcc aga cca gga ggt gaa agt ctt cag gat cag gaa 592 
Glu Glu Ser Ser Ala Arg Pro Gly Gly Glu Ser Leu Gin Asp Gin Glu 
75 80 85 

tea att cat tta cag ctt tec agt ttt tec age ctg caa gag gag gat 640 
Ser He His Leu Gin Leu Ser Ser Phe Ser Ser Leu Gin Glu Glu Asp 
90 95 100 

aaa tct agg aaa gat gac tct gaa aga gaa aaa gaa aag gat aaa aac 688 
Lys Ser Arg Lys Asp Asp Ser Glu Arg Glu Lys Glu Lys Asp Lys Asn 
105 110 115 120 

aaa gat aaa ace tct gaa aaa ccc aag ate aga atg tta tea aaa gat 736 
Lys Asp Lys Thr Ser Glu Lys Pro Lys He Arg Met Leu Ser Lys Asp 
125 130 135 

tgc age caa gaa tac acg gat tct aca ggc ata gac tta cac gag ttt 784 
Cys Ser Gin Glu Tyr Thr Asp Ser Thr Gly He Asp Leu His Glu Phe 
140 145 150 

ctg att aac aca tta aag aat aat tee agg gac agg atg ata ctt ttg 832 
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Leu lie Asn Thr Leu Lys Asn Asn Ser Arg Asp Arg Met lie Leu Leu 
155 160 165 

aaa atg gag cag gaa att att gat ttc att get gac aac aat aat cat 880 
Lys Met Glu Gin Glu lie He Asp Phe He Ala Asp Asn Asn Asn His 
170 175 180 

tat aaa aag ttc cct cag atg tea teg tat cag agg atg ctt gtc cat 928 
Tyr Lys Lys Phe Pro Gin Met Ser Ser Tyr Gin Arg Met Leu Val His 
185 190 195 200 

cga gtg gea get tat ttt gga ttg gat cac aat gtg gat caa aca gga 976 
Arg Val Ala Ala Tyr Phe Gly Leu Asp His Asn Val Asp Gin Thr Gly 
205 210 215 

aaa tct gtt ate ate aac aag ace age age ace aga ata eca gag caa 1024 
Lys Ser Val He He Asn Lys Thr Ser Ser Thr Arg He Pro Glu Gin 
220 225 230 

agg ttt tgt gaa cat tta aaa gat gaa aaa ggt gaa gaa tec cag aag 1072 
Arg Phe Cys Glu His Leu Lys Asp Glu Lys Gly Glu Glu Ser Gin Lys 
235 240 245 

egg ttt ate ttg aag cga gat aac tct agt att gat aaa gaa gac aat 1120 
Arg Phe He Leu Lys Arg Asp Asn Ser Ser He Asp Lys Glu Asp Asn 
250 255 260 

cag caa aac aga atg cat cca ttt aga gat gac aga cga agt aaa tea 1168 
Gin Gin Asn Arg Met His Pro Phe Arg Asp Asp Arg Arg Ser Lys Ser 
265 270 275 280 

att gaa gag aga gaa gag gaa tat cag aga gtg agg gag. aga ata ttt 1216 
He Glu Glu Arg Glu Glu Glu Tyr Gin Arg Val Arg Glu Arg He Phe 
285 290 295 

gea cac gat tea gtt tgc tec cag gaa age ctt ttt gtg gaa aac agg 1264 
Ala His Asp Ser Val Cys Ser Gin Glu Ser Leu Phe Val Glu Asn Arg 
300 305 310 

ggc aac aga gat ggc tea ggg aga aca tct ggg agt cga cag age age 1312 
Gly Asn Arg Asp Gly Ser Gly Arg Thr Ser Gly Ser Arg Gin Ser Ser 
315 320 325 

tea gaa aat gaa etc aag tgg tct gac cac caa agg gcc tgg age age 1360 
Ser Glu Asn Glu Leu Lys Trp Ser Asp His Gin Arg Ala Trp Ser Ser 
330 335 340 

aca gac tec gac agt tec aac cgc aat eta aag ecc gcc atg ace aag 1408 
Thr Asp Ser Asp Ser Ser Asn Arg Asn Leu Lys Pro Ala Met Thr Lys 
345 350 355 360 

aeg gcg agt ttt ggg ggc ate acg gtg ctg ace agg ggt gac age act 1456 
Thr Ala Ser Phe Gly Gly He Thr Val Leu Thr Arg Gly Asp Ser Thr 
365 370 375 

tec agt act agg agt acc ggg aag ctg tec aaa gea ggt tec gag tct 1504 
ser Ser Thr Arg Ser Thr Gly Lys Leu Ser Lys Ala Gly Ser Glu Ser 
380 385 390 

tec age agt gea ggc tec tea gga teg ctg tec cgc acc cat cca cct 1552 
Ser Ser Ser Ala Gly Ser Ser Gly Ser Leu Ser Arg Thr His Pro Pro 
395 400 405 

etc cag age aca eee eta gtc tea ggt gtg gea get ggc tct cca ggc 1600 
Leu Gin Ser Thr Pro Leu Val Ser Gly Val Ala Ala Gly Ser Pro Gly 
410 415 420 

tgt gtg cct tat cca gag aat gga ata ggg ggc cag gtt get ecc age 1648 
Cys Val Pro Tyr Pro Glu Asn Gly He Gly Gly Gin Val Ala Pro Ser 
425 430 435 440 

age acc age tac ate etc ctt cca ctt gaa get gea aca ggc ate ccg 1696 
Ser Thr Ser Tyr He Leu Leu Pro Leu Glu Ala Ala Thr Gly He Pro 
445 450 455 

cct gga age ate ctt ctt aat cca cac aca ggc cag ecc ttt gtg aat 1744 
Pro Gly Ser He Leu Leu Asn Pro His Thr Gly Gin Pro Phe Val Asn 
460 465 470 
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ccc gat gga act cct gca ata tac aac cca ccc acc agt cag cag ccc 1792 
Pro Asp Gly Thr Pro Ala lie Tyr Asn Pro Pro Thr Ser Gin Gin Pro 

475 480 485 

ctg cga age gcc atg gtg ggg cag tec eaa cag cag ccg cca cag cag 1840 
Leu Arg Ser Ala Met Val Gly Gin Ser Gin Gin Gin Pro Pro Gin Gin 
490 495 500 

cag ccc tec ccg cag ccc caa cag cag gtc cag cca ccg cag cca cag 18 68 
Gin Pro Ser Pro Gin Pro Gin Gin Gin Val Gin Pro Pro Gin Pro Gin 
505 510 515 520 

atg gca ggc cct ctg gtc act cag tct gtc cag ggg ctg cag get tec 1936 
Met Ala Gly Pro Leu Val Thr Gin Ser Val Gin Gly Leu Gin Ala Ser 
525 530 535 

tec cag tea gtg caa tat ccg gca gtc tct ttt cct ccc cag cac etc 1984 
Ser Gin Ser Val Gin Tyr Pro Ala Val Ser Phe Pro Pro Gin His Leu 
540 545 550 

eta cct gtg tct cca acg cag cac ttt ccc atg aga gat gat gtg gca 2032 
Leu Pro Val Ser Pro Thr Gin His Phe Pro Met Arg Asp Asp Val Ala 
555 560 565 

aca cag ttt ggc cag atg acc ctg age egg cag tec teg ggg gag act 2080 
Thr Gin Phe Gly Gin Met Thr Leu Ser Arg Gin Ser Ser Gly Glu Thr 
570 575 580 

cct gaa ccc cca tea ggt cct gtc tac cca tec tec ctt atg cca cag 2128 
Pro Glu Pro Pro Ser Gly Pro Val Tyr Pro Ser Ser Leu Met Pro Gin 
585 590 595 600 

ccg gcc cag cag ccc age tat gta ate gcc tct aca ggc cag cag ctt 2176 
Pro Ala Gin Gin Pro Ser Tyr Val lie Ala Ser Thr Gly Gin Gin Leu 
605 610 615 

cct aca gga gga ttc tea ggc tct ggc cct ccc ate tec cag cag gtc 2224 
Pro Thr Gly Gly Phe Ser Gly Ser Gly Pro Pro lie Ser Gin Gin Val 
620 625 630 

etc cag ccc cct ccc tea cca cag gga ttt gtg caa cag cct ccg cct 2272 
Leu Gin Pro Pro Pro Ser Pro Gin Gly Phe Val Gin Gin Pro Pro Pro 
635 640 645 

gca cag atg cct gta tat tat tac cca tct ggt cag tac cct ace tea 2320 
Ala Gin Met Pro Val Tyr Tyr Tyr Pro Ser Gly Gin Tyr Pro Thr Ser 
650 655 660 

acc acg caa cag tac egg ccc atg gcc ccg gtt cag tac aac get cag 2368 
Thr Thr Gin Gin Tyr Arg Pro Met Ala Pro Val Gin Tyr Asn Ala Gin 
665 670 675 680 

agg agt caa cag atg cca cag gca gca cag caa gca ggt tac cag cca 2416 
Arg Ser Gin Gin Met Pro Gin Ala Ala Gin Gin Ala Gly Tyr Gin Pro 
685 690 695 

gtc ttg tct ggt eaa cag gga ttc caa ggc eta ata gga gtg cag cag 2464 
Val Leu Ser Gly Gin Gin Gly Phe Gin Gly Leu He Gly Val Gin Gin 
700 705 710 

cca cct cag agt cag aac gtg ata aat aac caa caa gga act ccg gtg 2512 
Pro Pro Gin Ser Gin Asn Val He Asn Asn Gin Gin Gly Thr Pro Val 
715 720 725 

caa age gtg atg gtt tec tac cca aca atg tct tct tat cag gtg cca 2560 
Gin Ser Val Met Val Ser Tyr Pro Thr Met Ser Ser Tyr Gin Val Pro 
730 735 740 

atg acc cag ggt tct caa gga ctg ccc cag cag tea tac caa cag cca 2608 
Met Thr Gin Gly Ser Gin Gly Leu Pro Gin Gin Ser Tyr Gin Gin Pro 
745 750 755 760 

ate atg eta cct aac cag gca ggt eaa ggg tea etc cca gcc act gga 2656 
He Met Leu Pro Asn Gin Ala Gly Gin Gly Ser Leu Pro Ala Thr Gly 
765 770 775 

atg cct gtt tac tgt aat gtc aca ccg ccc acc cct cag aac aac ctt 2704 
Met Pro Val Tyr Cys Asn Val Thr Pro Pro Thr Pro Gin Asn Asn Leu 
780 785 790 
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agg ctg att ggc cca cac tgc ccc tec age act gtc cca gtg atg tea 2752 
Arg Leu lie Gly Pro His Cys Pro Ser Ser Thr Val Pro Val Met Ser 
795 800 805 

get age tgc aga aea aac tgt gca agt atg age aat get ggt tgg eag 2800 
Ala Ser Cys Arg Thr Asn Cys Ala Ser Met Ser Asn Ala Gly Trp Gin 
810 815 820 



gte aaa tte tga gage tct ggc tgtggtacat ttctteagat atttetcatg 

Val Lys Phe 

825 




geetttgatg 


gaagaggaae 


aaggtgggaa 


aaetggctga ggacttaagt 


atteaeteaa 


2912 


cactcaaatg 


attgctgetg 


gtattctgta 


aaaaataaac aaagactaat 


ataeacgtta 


2972 


getggttaat 


ggtgcatatt 


tctgtcatgt 


ctgetaggta tgcetttata gettagctag 


3032 


tgaeatgaat 


teateaaggt 


aagattttct 


cetaccactg aataecaetg 


tgtagattat 


3092 


aatatcccta 


atttggatta 


gttttgtact 


ttgtgttgag tttgtgatge 


taaaagtatt 


3152 


taaaaattat 


atactaaate 


aeattgtaec 


aaagctgtaa tggaaaagca 


aagaagaatt 


3212 


gatgaattga 


aggaataatt 


tatatacatt 


atagagtttt cttttttaat 


ggatatatae 


3272 


tgtattgtag 


tgtttaatca 


aaataaaaet 


atttgacctt atggaggaag 


gtcatgtttt 


3332 


taaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa aa 




3374 



<210> 46 

<211> 827 

<212> PRT 

<213> Homo sapiens 

<400> 46 

Met ser Glu Gin Gly Asp Leu Asn Gin Ala lie Ala Glu Glu Gly Gly 
15 10 15 

Thr Glu Gin Glu Thr Ala Thr Pro Glu Asn Gly lie Val Lys Ser Glu 
20 25 30 



Ser Leu Asp Glu Glu Glu Lys Leu Glu Leu Gin Arg Arg Leu Glu Ala 
35 40 45 



Gin Asn Gin Glu Arg TVrg Lys Ser Lys Ser Gly Ala Gly Lys Gly Lys 
50 55 60 



Leu Thr Arg Ser Leu Ala Val Cys Glu Glu Ser Ser Ala Arg Pro Gly 
65 70 75 80 



Gly Glu Ser Leu Gin Asp Gin Glu Ser lie His Leu Gin Leu Ser Ser 
85 90 95 



Phe Ser Ser Leu Gin Glu Glu Asp Lys Ser Arg Lys Asp Asp Ser Glu 
100 105 110 



Arg Glu Lys Glu Lys Asp Lys Asn Lys Asp Lys Thr Ser Glu Lys Pro 
115 120 125 



Lys lie Arg Met Leu Ser Lys Asp Cys Ser Gin Glu Tyr Thr Asp Ser 

130 135 140 

Thr Gly lie Asp Leu His Glu Phe Leu He Asn Thr Leu Lys Asn Asn 

145 150 155 160 
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Ser Arg Asp Arg Met lie Leu Leu Lys Met Glu Gin Glu lie He Asp 
165 170 175 

Phe He Ala Asp Asn Asn Asn His Tyr Lys Lys Phe Pro Gin Met Ser 
180 185 190 

Ser Tyr Gin Arg Met Leu Val His Arg Val Ala Ala Tyr Phe Gly Leu 
195 200 205 

Asp His Asn Val Asp Gin Thr Gly Lys Ser Val He He Asn Lys Thr 
210 215 220 

Ser Ser Thr Arg He Pro Glu Gin Arg Phe Cys Glu His Leu Lys Asp 
225 230 235 240 

Glu Lys Gly Glu Glu Ser Gin Lys Arg Phe He Leu Lys Arg Asp Asn 
245 250 255 

Ser Ser He Asp Lys Glu Asp Asn Gin Gin Asn Arg Met His Pro Phe 
260 265 270 

Arg Asp Asp Arg Arg Ser Lys Ser He Glu Glu Arg Glu Glu Glu Tyr 
275 280 285 

Gin Arg Val Arg Glu Arg He Phe Ala His Asp Ser Val Cys Ser Gin 
290 295 300 

Glu Ser Leu Phe Val Glu Asn Arg Gly Asn Arg Asp Gly Ser Gly Arg 
305 310 315 320 

Thr Ser Gly Ser Arg Gin Ser Ser Ser Glu Asn Glu Leu Lys Trp Ser 
325 330 335 

Asp His Gin Arg Ala Trp Ser Ser Thr Asp Ser Asp Ser Ser Asn Arg 
340 345 350 

Asn Leu Lys Pro Ala Met Thr Lys Thr Ala Ser Phe Gly Gly He Thr 
355 360 365 



Val Leu Thr Arg Gly Asp Ser Thr Ser Ser Thr Arg Ser Thr Gly Lys 
370 375 380 

Leu Ser Lys Ala Gly Ser Glu Ser Ser Ser Ser Ala Gly Ser Ser Gly 
385 390 395 400 



Ser Leu Ser Arg Thr His Pro Pro Leu Gin Ser Thr Pro Leu Val Ser 
405 410 415 



Gly Val Ala Ala Gly Ser Pro Gly Cys Val Pro Tyr Pro Glu Asn Gly 
420 425 430 

He Gly Gly Gin Val Ala Pro Ser Ser Thr Ser Tyr He Leu Leu Pro 
435 440 445 



Leu Glu Ala Ala Thr Gly He Pro Pro Gly Ser He Leu Leu Asn Pro 
450 455 460 



His Thr Gly Gin Pro Phe Val Asn Pro Asp Gly Thr Pro Ala He Tyr 
465 470 475 480 
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Asn Pro Pro Thr Ser Gin Gin Pro Leu Arg Ser Ala Met Val Gly Gin 
485 490 495 

Ser Gin Gin Gin Pro Pro Gin Gin Gin Pro Ser Pro Gin Pro Gin Gin 
500 505 510 

Gin Val Gin Pro Pro Gin Pro Gin Met Ala Gly Pro Leu Val Thr Gin 
515 520 525 

Ser Val Gin Gly Leu Gin Ala Ser Ser Gin Ser Val Gin Tyr Pro Ala 
530 535 540 

Val Ser Phe Pro Pro Gin His Leu Leu Pro Val Ser Pro Thr Gin His 
545 550 555 560 

Phe Pro Met Arg Asp Asp Val Ala Thr Gin Phe Gly Gin Met Thr Leu 
565 «. 570 575 

Ser Arg Gin Ser Ser Gly Glu Thr Pro Glu Pro Pro Ser Gly Pro Val 
580 585 590 

Tyr Pro Ser Ser Leu Met Pro Gin Pro Ala Gin Gin Pro Ser Tyr Val 
595 600 605 

lie Ala Ser Thr Gly Gin Gin Leu Pro Thr Gly Gly Phe Ser Gly Ser 
610 615 620 

Gly Pro Pro lie Ser Gin Gin Val Leu Gin Pro Pro Pro Ser Pro Gin 
625 630 635 640 



Gly Phe Val Gin Gin Pro Pro Pro Ala Gin Met Pro Val Tyr Tyr Tyr 
645 650 655 



Pro Ser Gly Gin Tyr Pro Thr Ser Thr Thr Gin Gin Tyr Arg Pro Met 

660 665 670 

Ala Pro Val Gin Tyr Asn Ala Gin Arg Ser Gin Gin Met Pro Gin Ala 

675 680 685 



Ala Gin Gin Ala Gly Tyr Gin Pro Val Leu Ser Gly Gin Gin Gly Phe 
690 695 700 



Gin Gly Leu lie Gly Val Gin Gin Pro Pro Gin Ser Gin Asn Val lie 
705 710 715 720 



Asn Asn Gin Gin Gly Thr Pro Val Gin Ser Val Met Val Ser Tyr Pro 
725 730 735 



Thr Met Ser Ser Tyr Gin Val Pro Met Thr Gin Gly Ser Gin Gly Leu 
740 745 750 



Pro Gin Gin Ser Tyr Gin Gin Pro lie Met Leu Pro Asn Gin Ala Gly 

755 760 765 

Gin Gly Ser Leu Pro Ala Thr Gly Met Pro Val Tyr Cys Asn Val Thr 
770 775 780 

Pro Pro Thr Pro Gin Asn Asn Leu Arg Leu lie Gly Pro His Cys Pro 
785 790 795 800 
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Ser Ser Thr Val Pro Val Met Ser Ala Ser Cys Arg Tbr Asn Cys Ala 
80S 810 815 

Ser Met Ser T^n Ala Gly Trp Gin Val Lys Phe 
820 825 

<210> 47 

<211> 3332 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (329) . . (2770) 

<223> 

<400> 47 

gtctattttt aatgctattt aatgaaggag cgagcgcctc actcagcaat aaaagaagca 60 

tgagggaaga cagagcagtg catggttatg gatactggac aaggatattt ggaaaggttg 120 

acgatgtgtc acactgtgta agggaatcgc atggagatgg gcattccgaa ctgttaatgg 180 

ggacatggga ctccagttgt ctctgatcac ttgtgtggat tttcctggcg tagaacgaca 240 

gaagccgcta gtaagtcgcc aagacctaca gcaggaattc tgcaccaaag ggcataaaat 300 

cttgttattt taatttgcat ctgggaga atg tct gag caa gga gac ctg aat 352 

Met Ser Glu Glxi Gly Asp Leu Asn 
1 5 

cag gca ata gca gag gaa gga ggg act gag cag gag acg gcc act cca 400 
Gin Ala lie Ala Glu Glu Gly Gly Thr Glu Gin Glu Thr Ala Thr Pro 
10 15 20 

gag aac ggc att gtt aaa tea gaa agt ctg gat gaa gag gag aaa ctg 448 
Glu Asn Gly lie Val Lys Ser Glu Ser Leu Asp Glu Glu Glu Lys Leu 
25 30 35 40 

gaa ctg cag agg egg ctg gag get cag aat caa gaa aga aga aaa tec 496 
Glu Leu Gin Arg Arg Leu Glu Ala Gin Asn Gin Glu Arg Arg Lys Ser 
45 50 55 

aag tea gga gca gga aaa ggt aaa ctg act cgc agt ctt get gte tgt 544 
Lys Ser Gly Ala Gly Lys Gly Lys Leu Thr Arg Ser Leu Ala Val Cys 
60 65 70 

gag gaa tct tct gcc aga cca gga ggt gaa agt ctt cag gat cag gaa 592 
Glu Glu Ser Ser Ala Arg Pro Gly Gly Glu Ser Leu Gin Asp Gin Glu 
75 80 85 

tea att cat tta cag ctt tec agt ttt tec age ctg caa gag gag gat 640 
Ser lie His Leu Gin Leu Ser Ser Phe Ser Ser Leu Gin Glu Glu Asp 
90 95 100 

aaa tct agg aaa gat gac tct gaa aga gaa aaa gaa aag gat aaa aac 688 
Lys Ser Arg Lys Asp Asp Ser Glu Arg Glu Lys Glu Lys Asp Lys Asn 
105 110 115 120 

aaa gat aaa acc tct gaa aaa ccc aag ate aga atg tta tea aaa gat 736 
Lys Asp Lys Thr Ser Glu Lys Pro Lys He Arg Met Leu Ser Lys Asp 
125 130 135 

tgc age caa gaa tac acg gat tct aca ggc ata gac tta cac gag ttt 784 
Cys Ser Gin Glu Tyr Thr Asp Ser Thr Gly He Asp Leu His Glu Phe 
140 145 150 

ctg att aac aca tta aag aat aat tec agg gac agg atg ata ctt ttg 832 
Leu He Asn Thr Leu Lys Asn Asn Ser Arg Asp Arg Met He Leu Leu 
155 160 165 

aaa atg gag cag gaa att att gat ttc att get gac aac aat aat cat 880 
Lys Met Glu Gin Glu He He Asp Phe He Ala Asp Asn Asn Asn His 
170 175 180 
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tat aaa aag ttc cct cag atg tea teg tat cag agg atg ctt gtc cat 928 
Tyr Lys Lys Phe Pro Gin Met Ser Ser Tyr Gin Arg Met Leu Val His 
185 190 195 200 



cga gtg gca get tat ttt gga ttg gat cac aat gtg gat caa aca gga 
Arg Val Ala Ala Tyr Phe Gly Leu Asp His Asn Val Asp Gin Thr Gly 
205 210 215 



age age aca gac tec gac agt tec aac egc aat eta aag ccc gcc atg 
Ser Ser Thr Asp Ser Asp Ser Ser Asn Arg Asn Leu Lys Pro Ala Met 
330 335 340 



976 



aaa tet gtt ate ate aac aag acc age age ace aga ata cca gag caa 1024 
Lys ser Val He He Asn Lys Thr Ser Ser Thr Arg He Pro Glu Gin 
220 225 230 

agg ttt tgt gaa cat tta aaa gat gaa aaa ggt gaa gaa tec cag aag 1072 
Arg Phe Cys Glu His Leu Lys Asp Glu Lys Gly Glu Glu Ser Gin Lys 
235 240 245 

egg ttt ate ttg aag cga gat aac tet agt att gat aaa gaa gac aat 1120 
Arg Phe He Leu Lys Arg Asp Asn Ser Ser He Asp Lys Glu Asp Asn 

250 255 260 

cag tea gtt tgc tec cag gaa age ctt ttt gtg gaa aac agt agg etc 1168 
Gin Ser Val Cys Ser Gin Glu Ser Leu Phe val Glu Asn Ser Arg Leu 
265 270 275 280 

ttg gaa gac agt aac ata tgc aat gag acc tat aag aaa aga cag etc 1216 
Leu Glu Asp Ser Asn He Cys Asn Glu Thr Tyr Lys Lys Arg Gin Leu 
285 290 295 

ttt egg ggc aac aga gat ggc tea ggg aga aca tet ggg agt cga cag 1264 
Phe Arg Gly Asn Arg Asp Gly Ser Gly Arg Thr Ser Gly Ser Arg Gin 
300 305 310 

age age tea gaa aat gaa etc aag tgg tet gac cac caa agg gcc tgg 1312 
Ser Ser Ser Glu Asn Glu Leu Lys Trp Ser Asp His Gin Arg Ala Trp 
315 320 325 



1360 



acc aag acg geg agt ttt ggg ggc ate aeg gtg etg acc agg ggt gac 1408 
Thr Lys Thr Ala Ser Phe Gly Gly He Thr Val Leu Thr Arg Gly Asp 
345 350 355 360 

age act tec agt act agg agt acc ggg aag ctg tec aaa gca ggt tec 1456 
Ser Thr Ser Ser Thr Arg Ser Thr Gly Lys Leu Ser Lys Ala Gly Ser 
365 370 375 

gag tet tec age agt gca ggc tec tea gga teg ctg tec egc ace eat 1504 
Glu ser ser Ser Ser Ala Gly Ser Ser Gly Ser Leu Ser Arg Thr His 
380 385 390 

cca cct etc cag age aca ccc eta gtc tea ggt gtg gca get ggc tet 1552 
Pro Pro Leu Gin Ser Thr Pro Leu Val Ser Gly Val Ala Ala Gly Ser 
395 400 405 

cca ggc tgt gtg cct tat cca gag aat gga ata ggg ggc cag gtt get 1600 
Pro Gly Cys Val Pro Tyr Pro Glu Asn Gly He Gly Gly Gin Val Ala 
410 415 420 

ccc age age acc age tac ate etc ctt cca ctt gaa get gca aca ggc 1648 
Pro Ser Ser Thr Ser Tyr He Leu Leu Pro Leu Glu Ala Ala Thr Gly 
425 430 435 440 

ate ccg cct gga age ate ctt ctt aat cca cac aca ggc cag ccc ttt 1696 
He Pro Pro Gly Ser He Leu Leu Asn Pro His Thr Gly Gin Pro Phe 
445 450 455 

gtg aat ccc gat gga act cct gca ata tac aac cca ccc acc agt cag 1744 
Val Asn Pro Asp Gly Thr Pro Ala He Tyr Asn Pro Pro Thr Ser Gin 
460 465 470 

cag ccc ctg cga age gcc atg gtg ggg cag tec caa cag cag ccg cca 1792 
Gin Pro Leu Arg Ser Ala Met Val Gly Gin Ser Gin Gin Gin Pro Pro 
475 480 485 

eag cag cag ccc tec ccg cag ccc caa cag cag gtc cag cca ccg cag 1840 
Gin Gin Gin Pro Ser Pro Gin Pro Gin Gin Gin Val Gin Pro Pro Gin 
490 495 500 
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cca cag atg gca ggc cct ctg gtc act cag tct gtc cag ggg ctg cag 1888 
Pro Gin Met Ala Gly Pro lieu Val Thr Gin Ser Val Gin Gly Leu Gin 
505 510 515 520 

get tec tee cag tea gtg caa tat ccg gca gtc tct ttt cct ccc cag 1936 
Ala Ser Ser Gin Ser Val Gin Tyr Pro Ala Val Ser Phe Pro Pro Gin 
525 530 535 

cac etc eta cct gtg tct cca acg cag cac ttt ccc atg aga gat gat 1984 
His Leu Leu Pro Val Ser Pro Thr Gin His Phe Pro Met Arg Asp Asp 
540 545 550 

gtg gca aca cag ttt ggc cag atg acc ctg age egg cag tec teg ggg 2032 
Val Ala Thr Gin Phe Gly Gin Met Thr Leu Ser Arg Gin Ser Ser Gly 
555 560 565 

gag act cct gaa ccc cca tea ggt cct gtc tac cca tec tec ett atg 2080 
Glu Thr Pro Glu Pro Pro Ser Gly Pro Val Tyr Pro Ser Ser Leu Met 
570 575 580 

eea cag ccg gee cag cag ccc age tat gta ate gee tet aca ggc cag 2128 
Pro Gin Pro Ala Gin Gin Pro Ser Tyr Val lie Ala Ser Thr Gly Gin 

585 590 595 600 

cag ett cct aca gga gga tte tea ggc tet ggc cct ccc ate tec cag 2176 
Gin Leu Pro Thr Gly Gly Phe Ser Gly Ser Gly Pro Pro He Ser Gin 
605 610 615 

cag gtc etc cag ccc cct ccc tea cca cag gga tty gtg caa cag cct 2224 
Gin Val Leu Gin Pro Pro Pro Ser Pro Gin Gly Phe Val Gin Gin Pro 
620 625 630 

ccg cct gca cag atg cct gta tat tat tac cca tct ggt cag tac cct 2272 
Pro Pro Ala Gin Met Pro Val Tyr Tyr Tyr Pro Ser Gly Gin Tyr Pro 
635 640 645 

ace tea acc acg caa cag tac egg ccc atg gee ccg gtt cag tac aac 2320 
Thr Ser Thr Thr Gin Gin Tyr Arg Pro Met Ala Pro Val Gin Tyr Asn 
650 655 660 

get cag agg agt caa cag atg cca cag gca gca cag caa gca ggt tac 2368 
Ala Gin Arg Ser Gin Gin Met Pro Gin Ala Ala Gin Gin Ala Gly Tyr 
665 670 675 680 

cag cca gtc ttg tct ggt caa cag gga tte caa ggc eta ata gga gtg 2416 
Gin Pro Val Leu Ser Gly Gin Gin Gly Phe Gin Gly Leu He Gly Val 
685 690 695 

cag eag cca cct cag agt cag aac gtg ata aat aac caa caa gga act 2464 
Gin Gin Pro Pro Gin Ser Gin Asn Val He Asn Asn Gin Gin Gly Thr 
700 705 710 

ccg gtg caa age gtg atg gtt tee tac eea aca atg tet tct tat eag 2512 
Pro Val Gin Ser Val Met Val Ser Tyr Pro Thr Met Ser Ser Tyr Gin 
715 720 725 

gtg cca atg acc cag ggt tct caa gga ctg ccc cag cag tea tac caa 2560 
Val Pro Met Thr Gin Gly Ser Gin Gly Leu Pro Gin Gin Ser Tyr Gin 
730 735 740 

cag cca ate atg eta cct aac cag gca ggt caa ggg tea etc cca gee 2608 
Gin Pro He Met Leu Pro Asn Gin Ala Gly Gin Gly Ser Leu Pro Ala 

745 750 755 760 

act gga atg cct gtt tac tgt aat gtc aca ccg ccc acc cct cag aac 2656 
Thr Gly Met Pro Val Tyr Cys Asn Val Thr Pro Pro Thr Pro Gin Asn 
765 770 775 

aac ett agg ctg att ggc cca cac tgc ccc tec age act gtc cca gtg 2704 
Asn Leu Arg Leu He Gly Pro His Cys Pro Ser Ser Thr Val Pro Val 
780 785 790 

atg tea get age tgc aga aca aac tgt gca agt atg age aat get ggt 2752 
Met Ser Ala Ser Cys Arg Thr Asn Cys Ala Ser Met Ser Asn Ala Gly 
795 800 805 

tgg cag gtc aaa tte tga gagctctggc tgtggtacat ttcttcagat 2800 
Trp Gin val Lys Phe 
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810 












atttctcatg 


gcctttgatg 


gaagaggaac aaggtgggaa 


aactggctga 


ggacttaagt 


2860 


attcactcaa 


cactcaaatg 


attgctgctg gtattctgta 


aaaartaaac 


aaagactaat 


2920 


atacacgtta 


gctggttaat 


ggtgcatatt tctgtcatgt 


ctgctaggta 


tgcctttata 


2980 


gcttagctag 


tgacatgaat 


tcatcaaggt aagattytct 


cctaccactg 


aataccactg 


3040 


tgtagattat 


aatatcccta 


atttggatta gttttgtact 


ttgtgttgag 


tttgtgatgc 


3100 


taaaagtatt 


taaaaattat 


atactaaatc acattgtacc aaagctgtaa 


tggaaaagca 


3160 


aagaagaayt 


gatgaattga 


aggaataatt tatatacatt 


atagagtttt 


cttttttaat 


3220 


ggatatatac 


tgtattgtag 


tgtttaatca aaataaaact 


atttgacctt 


atggaggaag 


3280 


gtcatgtttt 


taaaaaaaaa 


aaaaaaaaaa aaaaaaaaaa 


aaaaaaaaaa 


aa 


3332 



<210> 48 

<211> 813 

<212> PRT 

<213> Hcsmo sapiens 

<400> 48 

Met Ser 61u Gin Gly Asp Leu Asn Gin 2lla lie Ala Glu Glu Gly Gly 
15 10 15 

Thr Glu Gin Glu Thr Ala Thr Pro Glu Asn Gly lie Val Lys Ser Glu 
20 25 30 

Ser Leu Asp Glu Glu Glu Lys Leu Glu Leu Gin Arg Arg Leu Glu Ala 
35 40 45 



Gin Asn Gin Glu Arg Arg Lys Ser Lys Ser Gly Ala Gly Lys Gly Lys 
50 55 60 



Leu Thr Arg Ser Leu Ala Val Cys Glu Glu Ser Ser Ala Arg Pro Gly 
65 70 75 80 



Gly Glu Ser Leu Gin Asp Gin Glu Ser lie His Leu Gin Leu Ser Ser 
85 90 95 



Phe Ser Ser Leu Gin Glu Glu Asp Lys Ser Arg Lys Asp Asp Ser Glu 
100 105 110 



Arg Glu Lys Glu Lys Asp Lys Asn Lys Asp Lys Thr Ser Glu Lys Pro 
115 120 125 



Lys lie Arg Met Leu Ser Lys Asp Cys Ser Gin Glu Tyr Thr Asp Ser 
130 135 140 



Thr Gly lie Asp Leu His Glu Phe Leu He Asn Thr Leu Lys Asn Asn 
145 150 155 160 



Ser Arg Asp Arg Met He Leu Leu Lys Met Glu Gin Glu He He Asp 
165 170 175 



Phe He Ala Asp Asn Asn Asn His Tyr Lys Lys Phe Pro Gin Met Ser 
180 185 190 



Ser Tyr Gin Arg Met Leu Val His Arg Val Ala Ala Tyr Phe Gly Leu 
195 200 205 
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Asp His Asn Val Asp Gin Thr Gly Lys Ser Val He He Asn Lys Thr 
210 215 220 

Ser Ser Thr Arg He Pro Glu Gin Arg Phe Cys Glu His Leu Lys Asp 
225 230 235 240 

Glu Lys Gly Glu Glu Ser Gin Lys Arg Phe He Leu Lys Arg Asp Asn 
245 250 255 

Ser Ser He Asp Lys Glu Asp Asn Gin Ser Val Cys Ser Gin Glu Ser 
260 265 270 

Leu Phe Val Glu Asn Ser Arg Leu Leu Glu Asp Ser Asn He Cys Asn 
275 280 285 

Glu Thr Tyr Lys Lys Arg Gin Leu Phe Arg Gly Asn Arg Asp Gly Ser 
290 295 300 

Gly Arg Thr Ser Gly Ser Arg Gin Ser Ser Ser Glu Asn Glu Leu Lys 
305 310 315 320 

Trp Ser Asp His Gin Arg Ala Trp Ser Ser Thr Asp Ser Asp Ser Ser 
325 330 335 

Asn Arg Asn Leu Lys Pro Ala Met Thr Lys Thr Ala Ser Phe Gly Gly 
340 345 350 

He Thr Val Leu Thr Arg Gly Asp Ser Thr Ser Ser Thr Arg Ser Thr 
355 360 365 

Gly Lys Leu Ser Lys Ala Gly Ser Glu Ser Ser Ser Ser Ala Gly Ser 
370 375 380 

Ser Gly Ser Leu Ser Arg Thr His Pro Pro Leu Gin Ser Thr Pro Leu 
385 390 395 400 

Val Ser Gly Val Ala Ala Gly Ser Pro Gly Cys Val Pro Tyr Pro Glu 
405 410 415 



Asn Gly He Gly Gly Gin Val Ala Pro Ser Ser Thr Ser Tyr He Leu 
420 425 430 



Leu Pro Leu Glu Ala Ala Thr Gly He Pro Pro Gly Ser He Leu Leu 
435 440 445 



Asn Pro His Thr Gly Gin Pro Phe Val Asn Pro Asp Gly Thr Pro Ala 
450 455 460 



He Tyr Asn Pro Pro Thr Ser Gin Gin Pro Leu Arg Ser Ala Met Val 
465 470 475 480 



Gly Gin Ser Gin Gin Gin Pro Pro Gin Gin Gin Pro Ser Pro Gin Pro 
485 490 495 



Gin Gin Gin Val Gin Pro Pro Gin Pro Gin Met Ala Gly Pro Leu Val 
500 505 510 

Thr Gin Ser Val Gin Gly Leu Gin Ala Ser Ser Gin Ser Val Gin Tyr 
515 520 525 
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Pro Ala Val Ser Phe Pro Pro Gin His Leu Leu Pro Val Ser Pro Thr 
530 535 540 

Gin His Phe Pro Met Arg Asp Asp Val Ala Thr Gin Phe Gly Gin Met 
545 550 555 560 

Thr Leu Ser Arg Gin Ser Ser Gly Glu Thr Pro Glu Pro Pro Ser Gly 
565 570 575 

Pro Val Tyr Pro Ser Ser Leu Met Pro Gin Pro Ala Gin Gin Pro Ser 
580 585 590 

Tyr Val He Ala Ser Thr Gly Gin Gin Leu Pro Thr Gly Gly Phe Ser 
595 600 605 

Gly Ser Gly Pro Pro He Ser Gin Gin Val Leu Gin Pro Pro Pro Ser 
610 615 620 

Pro Gin Gly Phe Val Gin Gin Pro Pro Pro Ala Gin Met Pro Val Tyr 
625 630 635 640 

Tyr Tyr Pro Ser Gly Gin Tyr Pro Thr Ser Thr Thr Gin Gin Tyr Arg 
645 650 655 

Pro Met Ala Pro Val Gin Tyr Asn Ala Gin Arg Ser Gin Gin Met Pro 
660 665 670 

Gin Ala Ala Gin Gin Ala Gly Tyr Gin Pro Val Leu Ser Gly Gin Gin 
675 680 685 

Gly Phe Gin Gly Leu He Gly Val Gin Gin Pro Pro Gin Ser Gin Asn 
690 695 700 

Val He Asn Asn Gin Gin Gly Thr Pro Val Gin Ser Val Met Val Ser 
705 710 . 715 720 

Tyr Pro Thr Met Ser Ser Tyr Gin Val Pro Met Thr Gin Gly Ser Gin 
725 730 735 

Gly Leu Pro Gin Gin Ser Tyr Gin Gin Pro He Met Leu Pro Asn Gin 
740 745 750 

Ala Gly Gin Gly Ser Leu Pro Ala Thr Gly Met Pro Val Tyr Cys Asn 
755 760 765 

Val Thr Pro Pro Thr Pro Gin Asn Asn Leu Arg Leu He Gly Pro His 
770 775 780 

Cys Pro Ser Ser Thr Val Pro Val Met Ser Ala Ser Cys Arg Thr Asn 
785 790 795 800 

Cys Ala Ser Met Ser Asn Ala Gly Trp Gin Val Lys Phe 
805 810 



<210> 49 

<211> 3272 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> CDS 

<222> (329) . . (2710) 

<223> 



<400> 49 
gtctattttt 

tgagggaaga 

acgatgtgtc 

ggacatggga 

gaagccgcta 

cttgttattt 



aatgctattt 
cagagcagtg 
acactgtgta 
ctccagttgt 
gtaagtcgcc 
taatttgcat 



160 100 PCT.ST25 



aatgaaggag cgagcgcctc actcagcaat aaaagaagca 

catggttatg gatactggac aaggatattt ggaaaggttg 

agggaatcgc atggagatgg gcattccgaa ctgttaatgg 

ctctgatcac ttgtgtggat tttcctggcg tagaacgaca 

aagacctaca gcaggaattc tgcaccaaag ggcataaaat 

ctgggaga atg tct gag -caa gga gac ctg aat 
Met Ser Glu Gin Gly Asp i*eu Asn 
1 5 



cag gca ata gca gag gaa gga ggg act gag cag gag acg gcc act cca 
Gin Ala lie Ala Glu Glu Gly Gly Thr Glu Gin Glu Thr Ala Thr Pro 
10 15 20 

gag aac ggc att gtt aaa tea gaa agt ctg. gat gaa gag gag aaa ctg 
Glu Asn Gly lie Val Lys Ser Glu Ser Leu Asp Glu Glu Glu Lys Leu 
25 30 35 40 

gaa ctg cag agg egg ctg gag get cag aat caa gaa aga aga aaa tec 
Glu Leu Gin Arg Arg Leu Glu Ala Gin Asn Gin Glu Arg Arg Lys Ser 
45 50 55 

aag tea gga gca gga aaa ggt aaa ctg act cgc agt ett get gtc tgt 
Lys Ser Gly Ala Gly Lys Gly Lys Leu Thr Arg Ser Leu Ala Val Cys 
60 65 70 

gag gaa tct tct gcc aga cca gga ggt gaa agt ctt cag gat cag gaa 
Glu Glu Ser Ser Ala Arg Pro Gly Gly Glu Ser Leu Gin Asp Gin Glu 
75 80 85 

tea att cat tta cag ctt tec agt ttt tec age ctg caa gag gag gat 
Ser He His Leu Gin Leu Ser Ser Phe Ser Ser Leu Gin Glu Glu Asp 
90 95 100 

aaa tct agg aaa gat gac tct gaa aga gaa aaa gaa aag gat aaa aac 
Lys Ser Arg Lys Asp Asp Ser Glu Arg Glu Lys Glu Lys Asp Lys Asn 
.105 110 115 120 

aaa gat aaa acc tet gaa aaa cec aag ate aga atg tta tea aaa gat 
Lys Asp Lys Thr Ser Glu Lys Pro Lys He Arg Met Leu Ser Lys Asp 
125 130 135 

tgc age caa gaa tac acg gat tct aca ggc ata gac tta cac gag ttt 
Cys Ser Gin Glu Tyr Thr Asp Ser Thr Gly He Asp Leu His Glu Phe 
140 145 150 

ctg att aac aca tta aag aat aat tec agg gac agg atg ata ctt ttg 
Leu He J^n Thr Leu Lys Asn Asn Ser Arg Asp Arg Met He Leu Leu 
155 160 165 

aaa atg gag cag gaa att att gat ttc att get gac aac aat aat cat 
Lys Met Glu Gin Glu He He Asp Phe He Ala Asp Asn Asn Asn His 
170 175 180 

tat aaa aag ttc cct cag atg tea teg tat cag agg atg ctt gtc cat 
Tyr Lys Lys Phe Pro Gin Met Ser Ser Tyr Gin Arg Met Leu Val His 
185 190 195 200 

cga gtg gca get tat ttt gga ttg gat cac aat gtg gat caa aca gga 
Arg Val Ala Ala Tyr Phe Gly Leu Asp His Asn Val Asp Gin Thr Gly 
205 210 215 

aaa tct gtt ate ate aac aag acc age age ace aga ata cca gag caa 
Lys Ser Val He He Asn Lys Thr Ser Ser Thr Arg He Pro Glu Gin 

220 ■ 225 230 

agg ttt tgt gaa cat tta aaa gat gaa aaa ggt gaa gaa tec cag aag 
Arg Phe Cys Glu His Leu Lys Asp Glu Lys Gly Glu Glu Ser Gin Lys 
235 240 245 



60 
120 
180 
240 
300 
352 

400 

448 

496 

544 

592 

640 

688 

736 

784 

832 

880 

928 

976 

1024 

1072 
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egg ttt ate ttg aag cga gat aac tct agt att gat aaa gaa gac aat 1120 
Arg Phe lie Leu Lys Arg Asp Asn Ser Ser lie Asp Lys Glu Asp Asn 
250 255 260 

eag tea gtt tgc tec eag gaa age ctt ttt gtg gaa aae agg gge aac 1168 
Gin Ser Val Cys Ser Gin Glu Ser Leu Phe Val Glu Asn Arg Gly Asn 
265 270 275 280 

aga gat ggc tea ggg aga aca tct ggg agt cga cag age age tea gaa 1216 
Arg Asp Gly Ser Gly Arg Thr Ser Gly Ser Arg Gin Ser Ser Ser Glu 
285 290 295 

aat gaa etc aag tgg tct gae cae caa agg gee tgg age age aea gac 1264 
Asn Glu Leu Lys Trp Ser Asp His Gin Arg Ala Trp Ser Ser Thr Asp 
300 305 310 

tec gac agt tec aac cgc aat eta aag ccc gee atg ace aag acg gcg 1312 
Ser Asp Ser Ser Asn Arg Asn Leu Lys Pro Ala Met Thr Lys Thr Ala 
315 320 325 

agt ttt ggg ggc ate acg gtg ctg ace agg ggt gac age act tec agt 1360 
Ser Phe Gly Gly lie Thr Val Leu Thr Arg Gly Asp Ser Thr Ser Ser 
330 335 340 

act agg agt acc ggg aag ctg tec aaa gea ggt tec gag tct tee age 1408 
Thr Arg Ser Thr Gly Lys Leu Ser Lys Ala Gly Ser Glu Ser Ser Ser 
345 350 355 360 

agt gca ggc tec tea gga teg ctg tec cgc acc eat cca cet etc eag 1456 
Ser Ala Gly Ser Ser Gly Ser Leu Ser Arg Thr His Pro Pro Leu Gin 
365 370 375 

age aca ccc eta gtc tea ggt gtg gca get ggc tct cca ggc tgt gtg 1504 
Ser Thr Pro Leu Val Ser Gly Val Ala Ala Gly Ser Pro Gly Cys Val 
380 385 390 

cet tat cca gag aat gga ata ggg ggc cag gtt get ccc age age acc 1552 
Pro Tyr Pro Glu Asn Gly lie Gly Gly Gin Val Ala Pro Ser Ser Thr 
395 400 405 

age tac ate etc ctt cca ctt gaa get gca aea gge ate eeg cet gga 1600 
Ser Tyr lie Leu Leu Pro Leu Glu Ala Ala Thr Gly lie Pro Pro Gly 
410 415 420 

age ate ctt ctt aat cca cae aca ggc cag ccc ttt gtg aat ccc gat 1648 
Ser lie Leu Leu Asn Pro His Thr Gly Gin Pro Phe Val Asn Pro Asp 
425 430 435 440 

gga act cet gca ata tac aac cca ccc acc agt cag cag ccc ctg cga 1696 
Gly Thr Pro Ala He Tyr Asn Pro Pro Thr Ser Gin Gin Pro Leu Arg 
445 450 455 

age gee atg gtg ggg cag tee caa cag cag eeg cca cag cag eag ccc 1744 
Ser Ala Met Val Gly Gin Ser Gin Gin Gin Pro Pro Gin Gin Gin Pro 
460 465 470 

tec eeg cag ccc caa cag cag gtc cag cca eeg cag cca cag atg gca 1792 
Ser Pro Gin Pro Gin Gin Gin Val Gin Pro Pro Gin Pro Gin Met Ala 
475 480 485 

ggc cet ctg gtc act cag tct gtc cag ggg ctg cag get tec tec cag 1840 
Gly Pro Leu Val Thr Gin Ser Val Gin Gly Leu Gin Ala Ser Ser Gin 
490 495 500 

tea gtg caa tat ccg gea gtc tct ttt cet ccc cag cac etc eta cet 1888 
Ser val Gin Tyr Pro Ala Val Ser Phe Pro Pro Gin His Leu Leu Pro 
505 510 515 520 

gtg tct cca acg cag cac ttt ccc atg aga gat gat gtg gca aca cag 1936 
Val Ser Pro Thr Gin His Phe Pro Met Arg Asp Asp Val Ala Thr Gin 
525 530 535 

ttt ggc eag atg acc ctg age egg cag tec teg ggg gag act cet gaa 1984 
Phe Gly Gin Met Thr Leu Ser Arg Gin Ser Ser Gly Glu Thr Pro Glu 
540 545 550 

ccc cca tea ggt cet gtc tac cca tec tec ctt atg cca cag ccg gee 2032 
Pro Pro Ser Gly Pro Val Tyr Pro Ser Ser Leu Met Pro Gin Pro Ala 
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555 560 565 

cag cag ccc age tat gta ate gcc tct aca ggc cag cag ctt cct aca 2080 
Gin Gin Pro Ser Tyr Val He Ala Ser Thr Gly Gin Gin Leu Pro Thr 
570 575 580 

gga gga ttc tea ggc tct ggc cct ccc ate tec cag cag gtc etc cag 2128 
Gly Gly Phe Ser Gly Ser Gly Pro Pro He Ser Gin Gin Val Leu Gin 
585 590 595 600 

ccc cct ccc tea cca cag gga tty gtg caa cag cct ccg cct gca cag 2176 
Pro Pro Pro Ser Pro Gin Gly Phe Val Gin Gin Pro Pro Pro Ala Gin 
605 610 615 

atg cct gta tat tat tac cca tct ggt cag tae cct ace tea ace acg 2224 
Met Pro Val Tyr Tyr Tyr Pro Ser Gly Gin Tyr Pro Thr Ser Thr Thr 
620 625 630 

caa cag tac egg ccc atg gee ccg gtt cag tae aac get cag agg agt 2272 
Gin Gin Tyr Arg Pro Met Ala Pro Val Gin Tyr Asn Ala Gin Arg Ser 
635 640 645 

caa cag atg cca cag gca gca cag caa gca ggt tae cag cca gtc ttg 2320 
Gin Gin Met Pro Gin Ala Ala Gin Gin Ala Gly Tyr Gin Pro Val Leu 
650 655 660 

tct ggt caa cag gga ttc caa ggc eta ata gga gtg cag cag cca cct 2368 
Ser Gly Gin Gin Gly Phe Gin Gly Leu He Gly Val Gin Gin Pro Pro 
665 670 675 680 

cag agt cag aac gtg ata aat aac caa caa gga act ccg gtg caa age 2416 
Gin Ser Gin Asn Val He Asn Asn Gin Gin Gly Thr Pro Val Gin Ser 
685 690 695 

gtg atg gtt tec tac cca aca atg tct tct tat cag gtg cca atg acc 24 64 

Val Met Val Ser Tyr Pro Thr Met Ser Ser Tyr Gin Val Pro Met Thr 
700 705 710 

cag ggt tct caa gga ctg ccc cag cag tea tac caa cag cca ate atg 2512 
Gin Gly Ser Gin Gly Leu Pro Gin Gin Ser Tyr Gin Gin Pro He Met 
715 720 725 

eta cct aac cag gca ggt caa ggg tea etc cca gee act gga atg cct 2560 
Leu Pro Asn Gin Ala Gly Gin Gly Ser Leu Pro Ala Thr Gly Met Pro 
730 735 740 

gtt tae tgt aat gtc aca ccg ece acc cct cag aac aac ctt agg ctg 2608 
Val Tyr Cys Asn Val Thr Pro Pro Thr Pro Gin Asn Asn Leu Arg Leu 
745 750 755 760 

att ggc cca eac tgc ccc tec age act gtc cca gtg atg tea get age 2656 
He Gly Pro His Cys Pro Ser Ser Thr Val Pro Val Met Ser Ala Ser 
765 770 775 

tgc aga aca aac tgt gca agt atg age aat get ggt tgg cag gtc aaa 2704 
Cys Arg Thr Asn cys Ala Ser Met Ser Asn Ala Gly Trp Gin Val Lys 
780 785 790 

ttc tga gagetetggc tgtggtaeat ttettcagat atttctcatg gcctttgatg 2760 
Phe 

gaagaggaae aaggtgggaa aactggetga ggacttaagt attcactcaa cactcaaatg 2820 

attgctgctg gtattetgta aaaartaaac aaagaetaat atacacgtta gctggttaat 2880 

ggt gca tat t tetgteatgt ctgctaggta tgcetttata gcttagetag tgaeatgaat 2940 

teateaaggt aagattytct cetaccaetg aataccactg tgtagattat aatatcccta 3000 

atttggatta gttttgtact ttgtgttgag tttgtgatge taaaagtatt taaaaattat 3060 

atactaaatc acattgtace aaagctgtaa tggaaaagca aagaagaayt gatgaattga 3120 

aggaataatt tatatacatt atagagtttt cttttttaat ggatatatac tgtattgtag 3180 

tgtttaatca aaataaaact atttgacctt atggaggaag gteatgtttt taaaaaaaaa 3240 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 3272 
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<210> 50 

<211> 793 

<212> PRT 

<213> Homo sapiens 

<400> 50 

Met Ser Glu Gin Gly Asp Leu Asn Gin Ala lie Ala Glu Glu Gly Gly 
15 10 15 

Thr Glu Gin Glu Thr Ala Thr Pro Glu Asn Gly lie Val Lys Ser Glu 
20 25 30 

Ser Leu Asp Glu Glu Glu Lys Leu Glu Leu GXn Arg Arg Leu Glu Ala 
35 40 45 

Gin Asn Gin Glu Arg Arg Lys Ser Lys Ser Gly Ala Gly Lys Gly Lys 
50 55 60 

Leu Thr Arg Ser Leu Ala Val Cys Glu Glu Ser Ser Ala Arg Pro Gly 
65 70 75 80 

Gly Glu Ser Leu Gin Asp Gin Glu Ser lie His Leu Gin Leu Ser Ser 
85 90 95 

Phe Ser Ser Leu Gin Glu Glu Asp Lys Ser Arg Lys Asp Asp Ser Glu 
100 105 110 

Arg Glu Lys Glu Lys Asp Lys Asn Lys Asp Lys Thr Ser Glu Lys Pro 
115 120 125 

Lys lie Arg Met Leu Ser Lys Asp Cys Ser Gin Glu Tyr Thr Asp Ser 
130 135 140 

Thr Gly lie Asp Leu His Glu Phe Leu He Asn Thr Leu Lys Asn Asn 
145 150 155 160 

Ser Arg Asp Arg Met He Leu Leu Lys Met Glu Gin Glu He He Asp 
165 170 175 

Phe He Ala Asp Asn Asn Asn His Tyr Lys Lys Phe Pro Gin Met Ser 
180 185 190 

Ser Tyr Gin Arg Met Leu Val His Arg Val Ala Ala Tyr Phe Gly Leu 
195 200 205 

Asp His Asn Val Asp Gin Thr Gly Lys Ser Val He He Asn Lys Thr 
210 215 220 

Ser Ser Thr Arg He Pro Glu Gin Arg Phe Cys Glu His Leu Lys Asp 
225 230 235 240 

Glu Lys Gly Glu Glu Ser Gin Lys Arg Phe He Leu Lys Arg Asp Asn 
245 250 255 

Ser Ser He Asp Lys Glu Asp Asn Gin Ser Val Cys Ser Gin Glu Ser 
260 265 270 

Leu Phe Val Glu Asn Arg Gly Asn Arg Asp Gly Ser Gly Arg Thr Ser 
275 280 285 
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Gly Ser Arg Gin Ser Ser Ser Glu Asn Glu lieu Lys Trp Ser Asp His 
290 295 300 

Gin Arg Ala Trp Ser Ser Thr Asp Ser Asp Ser Ser Asn Arg Asn Leu 
305 310 315 320 

Lys Pro Ala Met Thr Lys Thr Ala Ser Phe Gly Gly He Thr Val Leu 
325 330 335 

Thr Arg Gly Asp Ser Thr Ser Ser Thr Arg Ser Thr Gly Lys Leu Ser 
340 345 350 

Lys Ala Gly Ser Glu Ser Ser Ser Ser Ala Gly Ser Ser Gly Ser Leu 
355 360 365 

Ser Arg Thr His Pro Pro Leu Gin Ser Thr Pro Leu Val Ser Gly Val 
370 375 380 

Ala Ala Gly Ser Pro Gly Cys Val Pro Tyr Pro Glu Asn Gly He Gly 
385 390 395 400 

Gly Gin Val Ala Pro Ser Ser Thr Ser Tyr He Leu Leu Pro Leu Glu 
405 410 415 

Ala Ala Thr Gly He Pro Pro Gly Ser He Leu Leu Asn Pro His Thr 
420 425 430 



Gly Gin Pro Phe Val Asn Pro Asp Gly Thr Pro Ala He Tyr Asn Pro 
435 440 445 

Pro Thr Ser Gin Gin Pro Leu Arg Ser Ala Met Val Gly Gin Ser Gin 
450 455 460 

Gin Gin Pro Pro Gin Gin Gin Pro Ser Pro Gin Pro Gin Gin Gin Val 
465 470 475 480 

Gin Pro Pro Gin Pro Gin Met Ala Gly Pro Leu Val Thr Gin Ser Val 
485 490 495 



Gin Gly Leu Gin Ala Ser Ser Gin Ser Val Gin Tyr Pro Ala Val Ser 
500 505 510 

Phe Pro Pro Gin His Leu Leu Pro Val Ser Pro Thr Gin His Phe Pro 
515 520 525 



Met Arg Asp Asp Val Ala Thr Gin Phe Gly Gin Met Thr Leu Ser Arg 
530 535 540 



Gin Ser Ser Gly Glu Thr Pro Glu Pro Pro Ser Gly Pro Val Tyr Pro 

545 550 555 560 

Ser Ser Leu Met Pro Gin Pro Ala Gin Gin Pro Ser Tyr Val He Ala 

565 570 575 



Ser Thr Gly Gin Gin Leu Pro Thr Gly Gly Phe Ser Gly Ser Gly Pro 
580 585 590 



Pro He Ser Gin Gin Val Leu Gin Pro Pro Pro Ser Pro Gin Gly Phe 
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595 600 605 

Val Gin Gin Pro Pro Pro Ala Gin Met Pro Val Tyr Tyr Tyr Pro Ser 
610 615 620 

Gly Gin Tyr Pro Thr Ser Thr Thr Gin Gin Tyr Arg Pro Met Ala Pro 
625 630 635 640 



val Gin Tyr Asn Ala Gin Arg Ser Gin Gin Met Pro Gin Ala Ala Gin 
645 650 655 



Gin Ala Gly Tyr Gin Pro Val Leu Ser Gly Gin Gin Gly Phe Gin Gly 
660 665 670 



Leu lie Gly Val Gin Gin Pro Pro Gin Ser Gin Asn Val lie Asn Asn 
675 680 685 



Gin Gin Gly Thr Pro Val Gin Ser Val Met Val Ser Tyr Pro Thr Met 
690 695 700 



Ser Ser Tyr Gin Val Pro Met Thr Gin Gly Ser Gin Gly Leu Pro Gin 
705 710 715 720 

Gin Ser Tyr Gin Gin Pro lie Met Leu Pro Asn Gin Ala Gly Gin Gly 
725 730 735 

Ser Leu Pro Ala Thr Gly Met Pro Val Tyr Cys Asn Val Thr Pro Pro 
740 745 750 

Thr Pro Gin Asn Asn Leu Arg Leu lie Gly Pro His Cys Pro Ser Ser 
755 760 765 

Thr Val Pro Val Met Ser Ala Ser Cys Arg Thr Asn Cys Ala Ser Met 
770 775 780 



Ser Asn Ala Gly Trp Gin Val Lys Phe 
785 790 



<210> 51 

<211> 1006 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (260).. (549) 

<223> 

<400> 51 



gggcagcttg 


agacaggtgg 


agctggatca 


agctgtgaac gtgatttgct ggaagctggt 


60 


cattagtgtt 


gacgatgtgt 


cacactgtgt 


aagggaatcg catggagatg ggcattccga 


120 


actgttaatg 


gggacatggg 


actccagttg 


tctctgatca cttgtgtgga ttttcctggc 


180 


gtagaacgac 


agaagccgct 


agtaagtcgc caagacctac agcaggaatt ctgcaccaaa 


240 


gggcataaaa 


tcttgttatt 


ttaatttgca 


tctgggaga atg tct gag caa gga 
Met Ser Glu Gin Gly 
1 5 


294 



gac ctg aat cag gca at a gca gag gaa gga ggg act gag cag gag acg 342 
Asp Leu Asn Gin Ala He Ala Glu Glu Gly Gly Thr Glu Gin Glu Thr 
10 15 20 

gcc act cca gag aac ggc att gtt aaa tea gaa agt ctg gat gaa gag 390 

Page 81 



wo 03/085095 



PCTAJS03/09921 



16U 100 PCT.ST25 

Ala Thr Pro Glu Asn Gly lie Val Lys Sex Glu Ser Leu Asp Glu Glu 
25 30 35 

gag aaa ctg gaa ctg cag agg egg ctg gag get eag aat caa gaa aga 438 
Glu Lys Leu Glu Leu Gin Arg Arg Leu Glu Ala Gin Asn Gin Glu Arg 
40 45 50 

aga aaa tec aag tea gga gca gga aaa ggt aaa ctg act cgc agt ctt 486 
Arg Lys Ser Lys Ser Gly Ala Gly Lys Gly Lys Leu Thr Arg Ser Leu 
55 60 65 

get gtc tgt gag gaa tct tot gee aga cca gga ggt gaa agt ett cag 534 
Ala Val Cys Glu Glu Ser Ser Ala Arg Pro Gly Gly Glu Ser Leu Gin 
70 75 80 85 

gat cag act etc tga aaactgcaaa tggaaaggaa ttcaaaagaa tttagattaa 589 
Asp Gin Thr Leu 



aagttaaata 


aaaagtaggc 


acagtagtgc 


tgaattttcc 


tcaaaggctc 


tcttttgata 


649 


aggctgaacc 


aaatataatc 


ccaagtatcc 


tctctccttc 


cttgttggag 


atgtcttacc 


709 


tctcagctcc 


caaaatgcac 


ttgcctataa 


gaaacacaat 


tgctggttca 


tatgaaactt 


769 


wagaaatagt 


gaataaggtg 


catttaactt 


tggagaaata 


cttttatgsc 


tttggtggag 


829 


atttctcaat 


actgcaaaag 


ttgtccagaa 


atgaatctga gctgatggtg 


actttaagtt 


889 


aatattatta 


atatatcact 


gcatattttt 


acccttattt 


ttgctcctta 


cagcaagatt 


949 


agtaggttat 


aaaaatttaa 


atttaaacaa 


aattatttca 


tgacaaaatg 


ggaaact 


1006 



<210> 52 

<211> 89 

<212> PRT 

<213> Homo sapiens 

<400> 52 

Met Ser Glu Gin Gly Asp Leu Asn Gin Ala lie Ala Glu Glu Gly Gly 
15 10 15 



Thr Glu Gin Glu Thr Ala Thr Pro Glu Asn Gly lie Val ' Lys Ser Glu 
20 25 30 



Ser Leu Asp Glu Glu Glu Lys Leu Glu Leu Gin Arg Arg Leu Glu Ala 
35 40 45 



Gin Asn Gin Glu Arg Arg Lys Ser Lys Ser Gly Ala Gly Lys Gly Lys 
50 55 60 



Leu Thr Arg Ser Leu Ala Val Cys Glu Glu Ser Ser Ala Arg Pro Gly 
65 70 75 80 



Gly Glu Ser Leu Gin Asp Gin Thr Leu 
85 



<210> 53 

<211> 807 

<212> PRT 

<213> Mus mas cuius 

<400> 53 

Met Ser Glu Gin Gly Gly Leu Thr Pro Thr He Leu Glu Glu Gly Gin 
1 5 10 • 15 

Thr Glu Pro Glu Ser Ala Pro Glu Asn Gly He Leu Lys Ser Glu Ser 
20 25 30 



Page 82 



wo 03/085095 



PCTAJS03/09921 



16n 100 PCT.ST25 

Leu Asp Glu Glu Glu Lys Leu Glu Leu Gin Arg Arg Leu Ala Ala Gin 
35 40 45 

Asn Gin Glu Arg Arg Lys Ser Lys Ser Gly Ala Gly Lys Gly Lys Leu 
50 55 60 

Thr Arg Ser Leu Ala Val Cys Glu Glu Ser Ser Ala Arg Ser Gly Gly 
65 70 75 80 

Glu Ser His Gin Asp Gin Glu Ser lie His Leu Gin Leu Ser Ser Phe 
85 90 95 

Pro Ser Leu Gin Glu Glu Asp Lys Ser Arg Lys Asp Asp Ser Glu Arg 
' 100 105 110 

Glu Lys Glu Lys Asp Lys Asn Arg Glu Lys Leu Ser Glu Arg Pro Lys 
115 120 125 

lie Arg Met Leu Ser Lys Asp Cys Ser Gin Glu Tyr Thr Asp Ser Thr 
130 135 140 

Gly lie Asp Leu His Gly Phe Leu lie Asn Thr Leu Lys Asn Asn Ser 
145 150 155 160 

Arg Asp Arg Met lie Leu Leu Lys Met Glu Gin Glu Met lie Asp Phe 
165 170 175 

lie Ala Asp Ser Asn Asn His Tyr Lys Lys Phe Pro Gin Met Ser Ser 
180 185 190 

Tyr Gin Arg Met Leu Val His Arg Val Ala Ala Tyr Phe Gly Leu Asp 
195 200 205 

His Asn Val Asp Gin Thr Gly Lys Ser Val He He Asn Lys Thr Ser 
210 215 220 

Ser Thr Arg He Pro Glu Gin Arg Phe Cys Glu His Leu Lys Asp Glu 
225 230 235 240 

Lys Ser Glu Glu Ser Gin Lys Arg Phe He Leu Lys Arg Asp Asn Ser 
245 250 255 

Ser He Asp Lys Glu Asp Asn Gin Asn Arg Met His Pro Phe Arg Asp 
260 265 270 

Asp Arg Arg Ser Lys Ser He Glu Glu Arg Glu Glu Glu Tyr Gin Arg 
275 280 285 

Val Arg Glu Arg He Phe Ala His Asp Ser Val Cys Ser Gin Glu Ser 
290 295 300 

Leu Phe Leu Asp Asn Ser Arg Leu Gin Glu Asp Met His He Cys Asn 
305 310 315 320 

Glu Thr Tyr Lys Lys Arg Gin Leu Phe Arg Ala His Arg Asp Ser Ser 
325 330 335 

Gly Arg Thr Ser Gly Ser Arg Gin Ser Ser Ser Glu Thr Glu Leu Arg 
340 345 350 
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Txp Pro Asp His Gin Arg Ala Trp Ser Ser Thr Asp Ser Asp Ser Ser 
355 360 365 

Asn Arg Asn Leu Lys Pro Thr Met Thr Lys Thr Ala Ser Phe Gly Gly 
370 375 380 

lie Thr Val Leu Thr Arg Gly Asp Ser Thr Ser Ser Thr Arg Ser Ala 
385 390 395 400 

Gly Lys Leu Ser Lys Thr Gly Ser Glu Ser Ser Ser Ser Ala Gly Ser 
405 410 415 

Ser Gly Ser Leu Ser Arg Thr His Pro Gin Ser Thr Ala Leu Thr Ser 
420 425 430 

Ser Val Ala RXa Gly Ser Pro Gly Cys Met Ala Tyr Ser Glu Asn Gly 
435 440 445 

Met Gly Gly Gin Val Pro Pro Ser Ser Thr Ser Tyr lie Leu Leu Pro 
450 455 460 

Leu Glu Ser Ala Thr Gly He Pro Pro Gly Ser He Leu Leu Asn Pro 
465 470 475 480 

His Thr Gly Gin Pro Phe Val Asn Pro Asp Gly Thr Pro Ala He Tyr 
465 490 495 

Asn Pro Pro Gly Ser Gin Gin Thr Leu Arg Gly Thr Val Gly Gly Gin 
500 505 510 

Pro Gin Gin Pro Pro Gin Gin Gin Pro Ser Pro Gin Pro Gin Gin Gin 
515 520 525 

Val Gin Ala Ser Gin Pro Gin Met Ala Gly Pro Leu Val Thr Gin Arg 
530 535 540 

Glu Glu Leu Ala Ala Gin Phe Ser Gin Leu Ser Met Ser Arg Gin Ser 
545 550 555 560 

Ser Gly Asp Thr Pro Glu Pro Pro Ser Gly Thr Val Tyr Pro Ala Ser 
565 570 575 

Leu Leu Pro Gin Thr Ala Gin Pro Gin Ser Tyr Val He Thr Ser Ala 
580 585 590 

Gly Gin Gin Leu Ser Thr Gly Gly Phe Ser Asp Ser Gly Pro Pro He 
595 600 605 

Ser Gin Gin Val Leu Gin Ala Pro Pro Ser Pro Gin Gly Phe Val Gin 
610 615 620 

Gin Pro Pro Pro Ala Gin Met Ser Val Tyr Tyr Tyr Pro Ser Gly Gin 
.625 630 635 640 

Tyr Pro Thr Ser Thr Ser Gin Gin Tyr Arg Pro Leu Ala Ser Val Gin 
645 650 655 



Tyr Ser Ala Gin Arg Ser Gin Gin He Pro Gin Thr Thr Gin Gin Ala 
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660 665 670 

Gly Tyr Gin Pro Val Leu Ser Gly Gin Gin Gly Phe Gin Gly Met Met 
675 680 685 

Gly Val Gin Gin Ser Ala His Ser Gin Gly Val Met Ser Ser Gin Gin 
690 695 700 

Gly Ala Pro Val His Gly Val Met Val Ser Tyr Pro Thr Met Ser Ser 
705 710 715 720 

Tyr Gin Val Pro Met Thr Gin Gly Ser Gin Ala Val Pro Gin Gin Thr 
725 730 735 

Tyr Gin Pro Pro lie Met Leu Pro Ser Gin Ala Gly Gin Gly Ser Leu 
740 745 750 

Pro Ala Thr Gly Met Pro Val Tyr Cys Asn Val Thr Pro Pro Asn Pro 
755 760 765 

Gin Asn Asn Leu Arg Leu Met Gly Pro His Cys Pro Ser Ser Thr Val 
770 775 780 

Pro Val Met Ser Ala Ser Cys Arg Thr Asn Cys Gly Asn Val Ser Asn 
785 790 795 800 

Ala Gly Trp Gin Val Lys Phe 
805 

<210> 54 

<211> 648 

<212> PRT 

<213> Homo sapien 

<400> 54 

Met He Leu Leu Lys Met Glu Gin Glu He He Asp Phe He Ala Asp 
15 10 15 

Asn Asn Asn His Tyr Lys Lys Phe Pro Gin Met Ser Ser Tyr Gin Arg 
20 25 30 

Met Leu Val His Arg Val Ala Ala Tyr Phe Gly Leu Asp His Asn Val 
35 40 45 

Asp Gin Thr Gly Lys Ser Val He He Asn Lys Thr Ser Ser Thr Arg 
50 55 60 

He Pro Glu Gin Arg Phe Cys Glu His Leu Lys Asp Glu Lys Gly Glu 
65 70 75 80 

Glu Ser Gin Lys Arg Phe He Leu Lys Arg Asp Asn Ser Ser He Asp 
85 90 95 

Lys Glu Asp Asn Gin Ser Val Cys Ser Gin Glu Ser Leu Phe Val Glu 
100 105 110 

Asn Arg Leu Leu Glu Asp Ser Asn He Cys Asn Glu Thr Tyr Lys Lys 
115 120 125 



Arg Gin Leu Phe Arg Gly Asn Arg Asp Gly Ser Gly Arg Thr Ser Gly 
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130 135 140 

Ser Arg Gin Ser Ser Ser Glu Asn Glu Leu Lys Trp Ser Asp His Gin 
145 150 155 160 

Arg Ala Tzp Ser Ser Thr Asp Ser Asp Ser Ser Asn Arg Asn Leu Lys 
165 170 175 

Pro Ala Met Thr Lys Thr Ala Ser Phe Gly Gly lie Thr Val Leu Thr 
180 185 190 

Arg Gly Asp Ser Thr Ser Ser Thr Arg Ser Thr Gly Lys Leu Ser Lys 
195 200 205 

Ala Gly Ser Glu Ser Ser Ser Ser Ala Gly Ser Ser Gly Ser Leu Ser 
210 215 220 

Arg Thr His Pro Pro Leu Gin Ser Thr Pro Leu Val Ser Gly Val Ala 
225 230 235 240 

Ala Gly Ser Pro Gly Cys Val Pro Tyr Pro Glu Asn Gly He Gly Gly 
245 250 255 

Gin Val Ala Pro Ser Ser Thr Ser Tyr He Leu Leu Pro Leu Glu Ala 
260 265 270 

Ala Thr Gly He Pro Pro Gly Ser He Leu Leu Asn Pro His Thr Gly 
275 280 285 

Gin Pro Phe Val Asn Pro Asp Gly Thr Pro Ala He Tyr Asn Pro Pro 
290 295 300 

Thr Ser Gin Gin Pro Leu Arg Ser Ala Met Val Gly Gin Ser Gin Gin 
305 310 315 320 

Gin Pro Pro Gin Gin Gin Pro Ser Pro Gin Pro Gin Gin Gin Val Gin 
325 330 335 

Pro Pro Gin Pro Gin Met Ala Gly Pro Leu Val Thr Gin Ser Val Gin 
340 345 350 

Gly Leu Gin Ala Ser Ser Gin Ser val Gin Tyr Pro Ala Val Ser Phe 
355 360 365 

Pro Pro Gin His Leu Leu Pro Val Ser Pro Thr Gin His Phe Pro Met 
370 375 380 

Arg Asp Asp Val Ala Thr Gin Phe Gly Gin Met Thr Leu Ser Arg Gin 
385 390 395 400 

Ser Ser Gly Glu Thr Pro Glu Pro Pro Ser Gly Pro Val Tyr Pro Ser 
405 410 415 

Ser Leu Met Pro Gin Pro Ala Gin Gin Pro Ser Tyr Val He Ala Ser 
420 425 430 

Thr Gly Gin Gin Leu Pro Thr Gly Gly Phe Ser Gly Ser Gly Pro Pro 
435 440 445 
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lie ser Gin Gin Val Leu 61a Pro Pro Pro Ser Pro Gin Gly Phe Val 
450 455 460 



Gin Gin Pro Pro Pro Ala Gin Met Pro Val Tyr Tyr Tyr Pro Ser Gly 
465 470 475 480 



Gin Tyr Pro Thr Ser Thr Thr Gin Gin Tyr Arg Pro Met Ala Pro Val 
485 490 495 

Gin Tyr Asn Ala Gin Arg Ser Gin Gin Met Pro Gin Ala Ala Gin Gin 
500 505 510 

Ala Gly Tyr Gin Pro Val Leu Ser Gly Gin Gin Gly Phe Gin Gly Leu 
515 520 525 



lie Gly Val Gin Gin Pro Pro Gin Ser Gin Asn Val He Asn Asn Gin 
530 535 540 



Gin Gly Thr Pro Val Gin Ser Val Met Val Ser Tyr Pro Thr Met Ser 
545 550 555 560 

Ser Tyr Gin Val Pro Met Thr Gin Gly Ser Gin Gly Leu Pro Gin Gin 
565 570 575 



Ser Tyr Gin Gin Pro lie Met Leu Pro Asn Gin Ala Gly Gin Gly Ser 
580 585 590 



Leu Pro Ala Thr Gly Met Pro Val Tyr Cys Asn Val Thr Pro Pro Thr 
595 600 605 

Pro Gin Asn Asn Leu Arg Leu He Gly Pro His Cys Pro Ser Ser Thr 
610 615 620 

Val Pro Val Met Ser Ala Ser Cys Arg Thr Asn Cys Ala Ser Met Ser 
625 630 635 640 

Asn Ala Gly Trp Gin Val Lys Phe 
645 



<210> 55 

<211> 651 

<212> PRT 

<213> Homo sapien 

<400> 55 

Arg Asp Arg Met He Leu Leu Lys Met Glu Gin Glu He He Asp Phe 
15 10 15 

He Ala Asp Asn Asn Asn His Tyr Lys Lys Phe Pro Gin Met Ser Ser 
20 25 30 

Tyr Gin Arg Met Leu Val His Arg Val Ala Ala Tyr Phe Gly Leu Asp 
35 40 45 

His Asn Val Asp Gin Thr Gly Lys Ser Val He He Asn Lys Thr Ser 
50 55 60 

Ser Thr Arg He Pro Glu Gin Arg Phe Cys Glu His Leu Lys Asp Glu 
65 70 75 80 
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Lys Gly Glu Glu Ser Gin Lys Arg Phe lie Leu Lys Arg Asp Asn Ser 
85 90 95 

Ser lie Asp Lys Glu Asp Asn Gin Ser Val Cys Ser Gin Glu Ser Leu 
100 105 110 

Phe Val Glu Asn Arg Leu Leu Glu Asp Ser Asn lie Cys Asn Glu Thr 
115 120 125 

Tyr Lys Lys Arg Gin Leu Phe Arg Gly Asn Arg Asp Gly Ser Gly Arg 
130 135 140 

Thr Ser Gly Ser Arg Gin Ser Ser Ser Glu Asn Glu Leu Lys Trp Ser 
145 150 155 160 

Asp His Gin Arg Ala Trp Ser Ser Thr Asp Ser Asp Ser Ser Asn Arg 
165 170 175 

Asn Leu Lys Pro Ala Met Thr Lys Thr Ala Ser Phe Gly Gly lie Thr 
180 185 190 

Val Leu Thr Arg Gly Asp Ser Thr Ser Ser Thr Arg Ser Thr Gly Lys 
195 200 205 

Leu Ser Lys Ala Gly Ser Glu Ser Ser Ser Ser Ala Gly Ser Ser Gly 
210 215 220 

Ser Leu Ser Arg Thr His Pro Pro Leu Gin Ser Thr Pro Leu Val Ser 
225 230 235 240 

Gly Val Ala Ala Gly Ser Pro Gly Cys Val Pro Tyr Pro Glu Asn Gly 
245 250 255 

lie Gly Gly Gin Val Ala Pro Ser Ser Thr Ser Tyr lie Leu Leu Pro 
260 265 270 

Leu Glu Ala Ala Thr Gly He Pro Pro Gly Ser He Leu Leu Asn Pro 
275 280 285 

His Thr Gly Gin Pro Phe Val Asn Pro Asp Gly Thr Pro Ala He Tyr 
290 295 300 

Asn Pro Pro Thr Ser Gin Gin Pro Leu Arg Ser Ala Met Val Gly Gin 
305 310 315 320 

Ser Gin Gin Gin Pro Pro Gin Gin Gin Pro Ser Pro Gin Pro Gin Gin 
325 330 335 

Gin Val Gin Pro Pro Gin Pro Gin Met Ala Gly Pro Leu Val Thr Gin 
340 345 350 

Ser Val Gin Gly Leu Gin Ala Ser Ser Gin Ser Val Gin Tyr Pro Ala 
355 360 365 

Val Ser Phe Pro Pro Gin His Leu Leu Pro Val Ser Pro Thr Gin His 
370 375 380 

Phe Pro Met Arg Asp Asp Val Ala Thr Gin Phe Gly Gin Met Thr Leu 
385 390 395 400 
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Ser Arg Gin Ser Ser Gly Glu Thr Pro Glu Pro Pro Ser Gly Pro Val 
405 410 415 

Tyr Pro Ser Ser Leu Met Pro Gin Pro Ala Gin Gin Pro Ser Tyr Val 
420 425 430 

lie Ala Ser Thr Gly Gin Gin Leu Pro Thr Gly Gly Phe Ser Gly Ser 
435 440 445 

Gly Pro Pro He Ser Gin Gin Val Leu Gin Pro Pro Pro Ser Pro Gin 
450 455 460 

Gly Phe Val Gin Gin Pro Pro Pro Ala Gin Met Pro Val Tyr Tyr Tyr 
465 470 475 480 

Pro Ser Gly Gin Tyr Pro Thr Ser Thr Thr Gin Gin Tyr Arg Pro Met 
485 490 495 

Ala Pro Val Gin Tyr Asn Ala Gin Arg Ser Gin Gin Met Pro Gin Ala 
500 505 510 

Ala Gin Gin Ala Gly Tyr Gin Pro Val Leu Ser Gly Gin Gin Gly Phe 
515 520 525 

Gin Gly Leu He Gly Val Gin Gin Pro Pro Gin Ser Gin Asn Val He 
530 535 540 

Asn Asn Gin Gin Gly Thr Pro Val Gin Ser Val Met Val Ser Tyr Pro 
545 550 555 560 

Thr Met Ser Ser Tyr Gin Val Pro Met Thr Gin Gly Ser Gin Gly Leu 
565 570 575 

Pro Gin Gin Ser Tyr Gin Gin Pro He Met Leu Pro Asn Gin Ala Gly 
580 585 590 

Gin Gly Ser Leu Pro Ala Thr Gly Met Pro Val Tyr Cys Asn Val Thr 
595 600 605 

Pro Pro Thr Pro Gin Asn Asn Leu Arg Leu He Gly Pro His Cys Pro 
610 615 620 

Ser Ser Thr Val Pro Val Met Ser Ala Ser Cys Arg Thr Asn Cys Ala 
625 630 635 640 

Ser Met Ser Asn Ala Gly Trp Gin Val Lys Phe 
645 650 

<210> 56 

<211> 89 

<212> PRT 

<213> Homo saplen 

<400> 56 

Met Ser Glu Gin Gly Asp Leu Asn Gin Ala He Ala Glu Glu Gly Gly 
15 10 15 

Thr Glu Gin Glu Thr Ala Thr Pro Glu Asn Gly He Val Lys Ser Glu 
20 25 30 
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Ser Leu Asp Glu Glu Glu Lys Leu Glu Leu Gin Arg Arg Leu Glu Ala 
35 40 45 

Gin Asn Gin Glu Arg Arg Lys Ser Lys Ser Gly Ala Gly Lys Gly Lys 
50 55 60 

Leu Thr Arg Ser Leu Ala Val Cys Glu Glu Ser Ser Ala Arg Pro Gly 
65 70 75 80 

Gly Glu Ser Leu Gin Asp Gin Thr Leu 
85 

<210> 57 

<211> 88 

<212> PRT 

<213> Mus musculus 

<400> 57 

Met Ser Glu Gin Gly Gly Leu Thr Pro Thr lie Leu Glu Glu Gly Gin 
15 10 15 

Thr Glu Pro Glu Ser Ala Pro Glu Asn Gly lie Leu Lys Ser Glu Ser 
20 25 30 

Leu Asp Glu Glu Glu Lys Leu Glu Leu Gin Arg Arg Leu Ala Ala Gin 
35 40 45 

Asn Gin Glu Arg Arg Lys Ser Lys Ser Gly Ala Gly Lys Gly Lys Leu 
50 55 60 

Thr Arg Ser Leu Ala Val Cys Glu Glu Ser Ser Ala Arg Ser Gly Gly 
65 70 75 



80 



Glu Ser His Gin Asp Gin Thr Leu 
85 



<210> 58 

<211> 4462 

<212> DNA 

<213> homo sapien 

<220> 

<221> CDS 

<222> (1336) (2163) 
<223> 



<400> 58 

tctctttaat ctttgcctat ccaactggta tctcagtggt ctcttaaggc aaattccttt 60 

aattttagtg acacactcat ttctactcga aacatttgcc tcattttcca tgatggacta 120 

ttttctaaaa cacccacaat atttcctaga ttcctagtgg cagttcccta agtgtgccaa 180 

aatcccccag cctgttagtg ttacttgtta tttaagcaaa atgattcaaa tcatcatttg 240 

atacatgatg gaaatcccag gctcactcac caaatttcta gcaaatattg tgtgtgcttg 300 

taagggggat gggagggcag gagaaggcct tgtatttctt ctagtcatta caagctagtg 360 

gtttttcttc ctcagtctgg aacttactcc ttgcacatac ctttttctgc actgtgccat 420 

catccacttc tctttacttc ctaactacca ccaactgaaa attatacata taaaagcttt 480 

aaacaaagtc tcttgaggct ctcaagggag tttacattac agtatagttc agcaaacaat 540 

tttaaatcaa atagtacacc tctttattct tagaattccc tctgccaaaa aagaaatcag 600 
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ctactttttt ttaaattcaa 


ggtccaactt 


tctgttgtgt 


tgctgattct 


cttctctctt 


660 


tttccaacga cttccacttc 


tctctctagt 


ttacatgtct 


ccaaacctta 


agcctctgtt 


720 


aatactttca caataagtca 


attttgccaa 


cggtttgcct 


cccctagacc 


atctaggctg 


.780 


ggcccagaac acctcatctt 


cactcccact 


gaagtgttcc 


tgaaggtcag 


ctctcactga 


840 


ccttgattct gctcccctac 


actgtcacca 


gaagctatcc 


acctatggtt 


ctaattcagt 


900 


aagtccaact ctctcacccc 


ctttttttgt 


ctcagctgtg 


tgggctttcc 


caggatggca 


960 


tgcaatggga cccctgtgcc 


atgcatattg 


taaaggaaaa 


tgcctccctc 


catgcgctac 


1020 


aaaacagcac atttatgatg 


gcactttgaa 


aagatatggg 


ttgtggtgtc 


acatattgac 


1 nnn 


aattccttgg ccagaggctt 


aacagtgcca 


gcagtgccag 


aagattaaga 


agacagcaaa 


1140 


aacagaaaag ggagaagatg 


gtgaagtagt 


tatataacat 


gagcgagaat 


gctcctgatt 


1200 


acaaagcaga gaaattgact 


ttttttctta 


gtgttttcta 


tagtcattgc 


tctatccctg 


1260 


ttctagaatt caagtcatga 


taagaatttc 


ttcacgttga 


cttcctgcat 


tgctttcaga 


1320 


cattgcaatt aaaga atg cga aga aag 
Met Arg Arg Lys 


aac etc aca gag gta aca gag ttt 
Asn Leu Thr Glu Val Thr Glu Phe 


1371 



15 10 

gtt ttc ctg gga ttc tec aga ttc cac aaa cat cac ate act etc ttt 1419 
Val Phe Leu Gly Phe Ser Arg Phe His Lys His His He Thr Leu Phe 
15 20 25 

gtg gtt ttt etc ate ctg tac aca tta act gtg get ggc aat gee ate 1467 
Val Val Phe Leu He Leu Tyr Thr Leu Thr Val Ala Gly Asn Ala He 
30 35 40 

ate atg acc ate ate tgc att gac cgt cae etc cac act cce atg tac 1515 
He Met Thr He He Cys He Asp Arg His Leu His Thr Pro Met Tyr 
45 50 55 60 

ttc ttc ctg age atg ctg get age tea aag aca gtg tac aca ctg ttc 1563 
Phe Phe Leu Ser Met Leu Ala Ser Ser Lys Thr Val Tyr Thr Leu Phe 
65 70 75 

ate att eca cag atg etc tec age ttc gta ace eag ace cag cea ate 1611 
He He Pro Gin Met Leu Ser Ser Phe Val Thr Gin Thr Gin Pro He 
80 85 90 

tec eta gca ggt tgt acc acc caa acg ttc ttc ttt gtt acc ttg gee 1659 
Ser Leu Ala Gly Cys Thr Thr Gin Thr Phe Phe Phe Val Thr Leu Ala 
95 100 105 

ate aac aat tgc ttc ttg etc aca gtg atg ggc tat gac cac tat atg 1707 
He Asn Asn Cys Phe Leu Leu Thr Val Met Gly Tyr Asp His Tyr Met 
110 lis 120 

gee ate tgc aat cce ttg aga tac agg gtc att acg age aag aag gtg 1755 
Ala He Cys Asn Pro Leu Arg Tyr Arg Val He Thr Ser Lys Lys Val 
125 130 135 140 

tgt gtc cag ctg gtg tgt gga gee ttt age att ggc ctg gee atg gca 1803 
Cys Val Gin Leu Val Cys Gly Ala Phe Ser He Gly Leu Ala Met Ala 
145 150 155 

get gtc cag gta aca tec ata ttt acc tta ect ttt tgt cac acg gtg 1851 
Ala Val Gin Val Thr Ser He Phe Thr Leu Pro Phe Cys His Thr VaX 
160 165 170 

gtt ggt cat ttc ttc tgt gac ate etc cct gtc atg aaa etc tec tgt 1899 
Val Gly His Phe Phe Cys Asp He Leu Pro Val Met Lys Leu Ser Cys 
175 180 185 

att aat ace act ate aat gag ata ate aat ttt gtt gtc agg tta ttt 1947 
He Asn Thr Thr He Asn Glu He He Asn Phe Val Val Arg Leu Phe 
190 195 200 

gtc ate ctg gtc cce atg ggt ctg gtc ttc ate tec tat gtc etc ate 1995 
val He Leu Val Pro Met Gly Leu Val Phe He Ser Tyr Val Leu He 
205 210 215 220 
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ate tec act gtc etc aag att gcc tea get gag ggt tgg aag aag acc 2043 
He Ser Thr VaX Leu Lys He Ala Ser Ala Glu Gly Trp Lys Lys Thr 
225 230 235 

ttt gcc acc tgt gcc ttc cac etc act gtg gtc att gtc cat tat ggc 2091 
Phe Ala Thr Cys Ala Phe His Leu Thr Val Val He Val His Tyr Gly 
240 245 250 

tgt get tee att gcc tac etc atg ccc aag tea gaa aac tct ata gaa 2139 
Cys Ala Ser He Ala Tyr Leu Met Pro Lys Ser Glu Asn Ser He Glu 
255 260 265 



caa gac etc ctt etc tea gtg acc taaaccatca tcactcccct gctgaaccet 2193 
Gin Asp Leu Leu Leu Ser Val Thr 



270 




275 










gttgtttaca 


gcctaaagaa 


caaggaggtc 


•aaggatgccc 


tatgcagggc 


catgggcaga 


2253 


aacatttctt 


aatgcattat 


tcctctatat 


aaatatacat 


ttagtcatag 


aaatgtgtgt 


2313 


ccttacttac 


attaaacaac 


cttacgactc 


tgtcccatgc 


agtctatgct 


gcaatgggat 


2373 


gtgeatgtct 


tgctttggta 


tatttactac 


aaaatcttag 


tetetgtttc 


catatatttc 


2433 


aaagttttgt 


ccaggcattt 


tcaactaggg 


atgtgagagg 


tcaaggagaa 


tgggcatgat 


2493 


ttttaggaaa 


gagcatccaa 


atttetagga 


tgaagaaagg 


gaettttaaa 


agtatattaa 


2553 


atatgattat 


attgtgttta 


aaaaataaaa 


agcaatgtgt 


ctcatttttg 


taatgeaatc 


2613 


tacagaaaat 


aaaactacaa 


aateatcgca 


gcaaggtaag 


gatagtcaat 


aatgatggat 


2673 


teccttgaaa 


gaaaatagta 


teagaattgt 


cagggaaaag 


tgagatgagc 


gtattaaatt 


2733 


taaaaaaaaa 


taggaaagtt 


ggaaaacact 


ggcttagtce 


ttgaaaattt 


agtctttatt 


2793 


attcaattta 


tgctaaagcc 


tttgctttta 


tccagtgtag 


tcgtcagatg 


ctggccatgg 


2853 


ccacagatca 


tacttaactc 


tcacctttct 


aatctaaatt 


ccctaattga 


attctttctg 


2913 


getgetggtt 


ctetccatgg 


gateaacttc 


tctctaatea 


ttatgaagaa 


aaattgagtt 


2973 


ggtcaagaag 


atctgtgccc 


tgttagaata 


agaaccataa 


aagetttcct 


catttgcaca 


3033 


taccatggca 


cctcctggta 


geataaagaa 


acaaaagtag 


aacaaaacaa 


acagteccag 


3093 


eagtcaggag 


tagttcagaa 


gtataattgt 


agaatcactc 


aattcaccaa 


aaaaggetaa 


3153 


eaagaaaaaa 


aaatattttt 


ccttagtaga 


ctcctttgag 


aaaaatatcc 


tttttcccag 


3213 






ggtttcatgc 


ctatacat ac 


cttctatgtg 


gttatcttgg 


"30*7 1 


gtgatgggtg 


atggtaggga 


gggtttaaga 


cttttgttga 


gtgcttcett 


ctactaaaat 


3333 


attttttgtc 


aagaatactt 


ctctgetcat 


tecctaatgc 


eatttttctt 


tttttcaaga 


3393 


cagtcatttt 


tttectttec 


ctccactgag 


aaagaatcga 


taacatccta 


aggagctcat 


3453 


etcaggtaaa 


gaatatcttt 


acagatttct 


gaattctaga 


ttggggagac 


cattgttctt 


3513 


teaaggtctg 


aceagttett 


tctaattect 


gtcttgtgtc 


tttgcaattt 


ctacatcata 


3573 


gaaaeaaggc 


ttcctaeaga 


agctgttggg 


ggctcaaagg 


ttgggtccaa 


gagtcttgga 


3633 


catgctatga 


ggtctttctt 


gacagcactc 


tcagggtcat 


cccactacag 


atetaaacct 


3693 


ctatgeccaa 


caagagtgag 


gcagattecc 


tccagaatga 


gagtgtctct 


cctgacagta 


3753 


tgaaagaacc 


cctcccactg 


ceateaaggc 


cacetgtact 


gccaactctt 


aattataacc 


3813 


tgcccttctc 


aattctctct 


ggctageaac 


tacttgtaca 


tcctcatacc 


tgtctttttg 


3873 


tgtgtagaag 


ggaagagatg 


agacagagag 


ggctgttctg 


agcaggaaag 


tacaatgatg 


3933 


gaagttgggt 


ggtgatcaaa 


cctgttctat 


cctgctccag 


tggcagtggc 


caactcccac 


3993 


tgcccattac 


tetcctttct 


acttctgcac 


acagecactc 


tccttcatta 


gttcatgaat 


4053 
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caaagacaaa 


ggtttctcca 


gtattgtctc 


tacatctaaa 


tgctgcaaca 


gcagacatac 


4113 


cacacgccac 


tcgtgcaatc 


aaaagttaaa 


tgtcacagcg 


gggcccacaa 


agagctggca 


4173 


gaaaatatga 


gtcataaatc 


ttgaaggtga 


aatatctttg 


tctctaactc 


tgttacccac 


4233 


aatggttcaa 


tcatatgtcc 


agtttttttt 


aaataaacat 


gtgtgcttct 


aatttcttct 


4293 


tgacaataag 


atgttgtctt 


acatggtgga 


tagaatagag 


gtcccagagt 


cagatgggtg 


4353 


ctgagttcac 


ttcctcactc 


agcacccatc 


agaacctttt 


tatgaatata 


atgtttttaa 


4413 


ttctgcctcg 


cttgtcatca 


aaagcatttt 


ttaatctcct 


tacccattt 




4462 



<210> 59 

<211> 276 

<212> PRT 

<213> homo sapien 

<400> 59 

Met Arg Arg Lys Asn Leu Thr Glu Val Thr Glu Phe Val Phe Leu Gly 
15 10 15 

Phe Ser Arg Phe His Lys His His He Thr Leu Phe Val Val Phe Leu 
20 25 30 

He Leu Tyr Thr Leu Thr Val Ala Gly Asm Ala He He Met Thr He 
35 40 45 

He Cys He Asp Arg His Leu His Thr Pro Met Tyr Phe Phe Leu Ser 
50 55 60 

Met Leu Ala Ser Ser Lys Thr Val Tyr Thr Leu Phe He He Pro Gin 
65 70 75 80 

Met Leu Ser Ser Phe Val Thr Gin Thr Gin Pro He Ser Leu Ala Gly 
85 90 95 

Cys Thr Thr Gin Thr Phe Phe Phe Val Thr Leu Ala He Asn Asn Cys 
100 105 110 

Phe Leu Leu Thr Val Met Gly Tyr Asp His Tyr Met Ala He Cys Asn 
115 120 125 

Pro Leu Arg Tyr Arg Val He Thr Ser Lys Lys Val Cys Val Gin Leu 
130 135 140 

Val Cys Gly Ala Phe Ser He Gly Leu Ala Met Ala Ala Val Gin Val 
145 150 155 160 

Thr Ser He Phe Thr Leu Pro Phe Cys His Thr Val Val Gly His Phe 
165 170 175 

Phe Cys Asp He Leu Pro Val Met Lys Leu Ser Cys He Asn Thr Thr 
180 185 190 

He Asn Glu He He Asn Phe Val Val Arg Leu Phe Val He Leu Val 
195 200 205 

Pro Met Gly Leu Val Phe He Ser Tyr Val Leu He He Ser Thr Val 
210 215 220 



Leu Lys He Ala Ser Ala Glu Gly Trp Lys Lys Thr Phe Ala Thr Cys 
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225 230 235 240 

Ala Phe His Leu Thr Val Val He Val His Tyr Gly Cys Ala Ser He 
245 250 255 

Ala Tyr Leu Met Pro Lys Ser Glu Asn Ser He Glu Gin Asp Leu Leu 
260 265 270 

Leu Ser Val Thr 
275 
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